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NUMBER-THEORETICAL ALMOST-PERIODICITIES.* 


By AurEL WINTNER. 


The present paper deals with an harmonic analysis of various types of 
“hidden periodicities” in the theory of whole numbers. The “ periodicities ” 
in question, the simplest manifestations of which have so much fascinated 
the Greeks, are not of course periodicities proper, and so it is remarkable that 
the corresponding arithmetical recurrences possess at all an harmonic analysis 
in the technical sense of the term. In fact, the “recurrences” prove to corre- 
spond to an almost-periodice behavior in the sense (B) of Besicovich. It is 
revealing for the whole situation that, for instance, Bohr’s class of uniformly 
almost-periodic sequences turns out to be all too restrictive to represent the 
effect of arithmetical fluctuations, except in trivial cases (cases which cannot 
even occur in connection with the “ Eulerian” and “ Dirichletian” algorithms, 
treated in 5 and 9 respectively). 

The emphasis in the results obtained is always on the circumstance that 
the resulting expansions into trigonometric series, instead of claiming formal 
identities, represent Fourier expansions for the arithmetical sequences to 
which they belong. Otherwise the results would have nothing to do with an 
harmonic analysis of the “hidden periodicities” in arithmetics. 


1. By a function A will be meant a function h =h(n) or h=—h» of a 
positive integer, that is, a sequence h(1),A(2),-- or Ay, ho, - The 
symbol (hk) will denote the mean-value 


(1) M(h) —lim (1/n) 


n->X m=1 
provided that (1) exists as a finite limit. 


Consider a linear transformation 


(2) f(m) em(m)g(m) 


of an arbitrary function g into a function f. The latter is defined for an 
arbitrary g if and only if the matrix of (2) is “zeilenfinit” (Toeplitz), that 
is to say such that there exists a function N = N,» having the property that 


(3) €m(n) =0 whenever n>WNm, (m=1,2,:--). 
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For instance, (3) is satisfied by NV», = m if the matrix of (2) is “ triangular ”, 

Suppose that the matrix of (2) satisfies (3) and is such that its elements 
contained in any fixed column form an almost-periodic function (B) of the 
index of the row, that is, that the functions 


(4) *,@m(n),* are almost-periodic (B) 


(the variable is n, while m is arbitrarily fixed). Then the absolute value of 
each of the functions (4) of n is almost-periodic (B). In particular, the mean- 
value M(| ém|), defined by the case h = em of (1), exists for m= 1,2,---. 
If the matrix of the linear transformation (2) satisfies (3) and (4), it is 

easy to verify that any function gq satisfying 

co 
(5) (| em|)| g(m)| < 

m=1 
is transformed by (2) into a function {(m) which is almost-periodic (B) and 


such that every Fourier constant of f(n) is the limit, as ko, of the 
corresponding Fourier constant of f,(n), where 


(6) fe(n) = 3 ¢m(n)g(m) 


in fact, f; will tend, as k > oo, to f in the mean of the (B)-space (cf. [11], 
pp. 24-25). 

Incidentally, it is clear from the proof of these facts that the full force 
of the assumption (3), an assumption which is chosen independent of g, is 
not needed. 


2. Suppose that the Fourier series (B) of each of the functions (4) 
is periodic and that the m-th of these Fourier series (B) has the period m, 
where m=1,2,:--. This means that there exist m constants am(1),°--, 
Gm(m) satisfying 


(7) em(N) %m(j)e(nj/m), 
j=1 

where m is arbitrarily fixed and 

(8) e(x) == exp (2miz). 


On the other hand, none of the functions (4) of n is required to be 
periodic. In other words, (7) is compatible with 


(9) Cm(n) ~e™(n), 


if e™(n) denotes the sum on the right of (7). All that can be said is that, 
besides (7), 


NUMBER-THEORETICAL ALMOST-PERIODICITIES. 175 


(10) om (n) ~ am(j)e(nj/m), 
since 
(11) em(n) 3 am(j)e(nj/m) 


is a finite, and therefore a uniformly convergent, trigonometric sum in n. 
Correspondingly, the Fourier constants %m(j) can be calculated not only by 
the Fourier inversion 


(12) Ma&m(j) = & &m(n)e(— jn/m) 
n=1 
of (7) but also by the Fourier inversion 


(13) (7) — e™(n)e(— jn/m) 
of (11). 

For the purposes of 5 below, it will be fundamental that the possibility (9) 
is not excluded by the assumptions of the following theorem: 


(I) Jf the matrix of a linear transformation (2) satisfies (3) and the 
particular case (7%) of (4), and if g is any function satisfying (5), then the 
function f into which (2) transforms g 1s almost-periodic (B) and has a 
limit-periodic Fourier series (B), 

(8,7r)=1 


(14) f(n) ~ ar*e(ns/r), 


in which the Fourier constants have the values represented by the series 
(15) 
rik 


(each of which is absolutely convergent) ; so that, in particular, 

(16) M(f) =a; = 3 9(k)a%(k). 
k=1 


It is clear from (7) and from the definition of a Fourier constant that 
&m(j)| M(| em |) for every j. On the other hand, the summation index k 
in (5) runs through all multiples of r, that is, the series (15) can be written 
in the form 


Gnr(ns)g(nr). 
n=1 


Hence, if the preceding inequality is applied to m—=nr and j= xr, the 
series (15) is seen to be majorized by the series 


enr |)! g(nr)|. 
n=1 


| 

| 
| 
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But the latter series is a subseries of the series (5), since r 1s any fixed 
positive integer. This proves the parenthetical assertion following (15). 

Of the remaining statements of (I), only the explicit representation (14), 
(15) of the Fourier analysis (B) of f remains to be proved. In fact, the 
almost-periodicity (B) of f follows from 1, even if the restriction (10) of (4) 
is omitted. Correspondingly, the proof of (I) will be complete if it is shown 
that, in virtue of the restriction (10), the function (6) has a Fourier series 
(B) of the form 


(8,r)=1 
(17) fx(n) ~S & a-*(k)e(ns/r) 
r=11=s=r 


(for every fixed &) and that, as k > ©, each of the Fourier constants of (17) 
tends to the limit 
(18) lim a,;*(k) Onr(ns)g(nr), 
k->00 n=1 
that is, to the value (15). In fact, the remark following (6) implies that (17) 
and (18) are sufficient for (14) and (15). 
Substitution of (10) into (6) gives 


mM 


(19) fe(n) ~3 3 am(j)¢(nj/m). 


This can be written in the form (17), if all those pairs j, m are collected in 
(19) for which the quotient j/m has a fixed value s/r, where s and r are 
relatively prime. In other words, (17) is satisfied by 


(20) = an(j). 

j/M=8/T 
But every pair of summation indices j, m occurring in (20) can be represented 
in the form 7 = ns, m = nr, where n is a unique positive integer. It is also 
clear that this n will attain all values 1,2,--- as k—>o in (20). Hence, 
(18) follows from (20) and from the fact that, as shown above, the series on 
the right of (18) is absolutely convergent. 

This completes the proof of (I). 


3. Suppose that the matrix of (2) satisfies the assumptions, (2), (4) 
and (7), of (I). 

Of particular arithmetical interest proves to be the case of those among 
the matrices which have the property that, for any function g satisfying (5), 
the ¢(r) coefficients (15) of each of the interior sums in (14) are identical 
except for fixed phase factors. By this is meant that there should exist for 
every s and every r, where 


’ 
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(21) (s,r)==1 and (r= 1,2,-:-), 


a constant «,-* which depends only on the matrix of (2) and has the property 
that, on the whole s-range (21) belonging to any fixed r, the Fourier constants 
(15) are expressible in the form 


(22) = e(— ar®) by, 


where the coefficients b,,b2,- + + are independent of s, and g in (15) is any 
function satisfying (5). In other words, the Fourier series (14) should have 
the particular structure 


(23) f(n) ~3 brs (n), 


where the trigonometric sums 
(s,r)=1 


depend only on the matrix of (2). 

According to (22) and (15), the matrix of (2) will have this property 
if and only if there exists for every function g—g(m) satisfying (5) a 
function b = b,, satisfying 


(25) by = €(a,*) 


where, for any fixed r, the integer s is restricted only by (21). Clearly, (25) 
can be written in the form. 


(26) 9 (rm) {¢ (a,*) (ns) —€(a,r*) any (nt) }, 


where ¢ denotes any fixed s satisfying (21) with reference to a given r. But 
(26) is required for every g satisfying (5) and, therefore, for every g = 9(m) 
tending sufficiently fast to 0 as m—»o. Since the expression { } multi- 
plying g(rn) in (26) depends only on the matrix of (2), it follows that every 
{ } in (26) rie vanish. But ¢ is a fixed s satisfying (21). Hence, the 
vanishing of every { } in (26) means that, if r,n is any pair of positive 
integers, the product ¢(@,*)¢nr(ns) has a value independent of the choice of 
the #(7) integers s satisfying (21). If r and nr are denoted by d.and m, 
respectively, then this necessary and sufficient condition for the phase rule 
22) appears in the following form: 

For every positive integer, m, and for every divisor, d, of m, there exists 
a value, X= Am(d), such that 


(27) &m(sm/d) = Am(d) 
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holds for all ¢(d) integers satisfying 


(28) (s,d)=1 and 1sXd, where d|m. 


This criterion can readily be applied to an explicit determination of the 
linear transformations (2) in question. In fact, the number of pairs of 
integers s,d, satisfying (28) for a fixed m is 


(8,d)=1 
= 1—3¢(d) —™m. 
d|jm dim 


Correspondingly, the Fourier expansion (7) of m terms can be written in 


the form 
(8,d)=1 


(29) €m(n) &m(sm/d)e(ns/d), 


dim 


whether (27) be satisfied or not. Consequently, the Fourier constants %m(j) 
of the columns (4) of the matrix of (2) satisfy (27) if and only if the 
Fourier expansions (7) can be represented in the form 


(8,d)=1 
em(n) d ) + ns/d), 
which means, by (24), that 
(30) €m(n) ~ 3 Am (d)Ba(n). 
Since (27) reduces (25) to 
(31) b= 


the result of these considerations may be summarized as follows: 


(II) Let the matrix of a linear transformation (2) satisfy the assump- 
tions, (3), (4) and (7), of (1). Then the Fourier series (14) will have the 
particular form (23), (24) belonging to a given set of (not necessarily real) 
absolute constants a,* if and only if there exist absolute constants Am(n) by 
means of which the Fourier series of the columns (4) of the matrix are 
expressible in the form (30); im which case the coefficients of the Fourier 
series (23) of the almost-periodic (B) functions f(n) are given by the abso- 
lutely convergent series (31), where g(n) denotes the function, subject to (5), 
which defines f(n) by (2). 


4. Perhaps the simplest particular case results if (2) is the linear 
transformation corresponding to the sieve of Eratosthenes, namely, 


(32) f(n) ==f(d), 


din 


he 
of 


he 
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where 

(33) g(n) =f'(n). 

The functional notation (33) in the case (32) of (2) is justified by the fact 
that the function (33) is uniquely determined by (32), if f(m) is any given 
function. In fact, if (32) is written in the form (2), then (3) is satisfied by 
Nm =m, and the diagonal element em(m) is 1 for every m. Thus, the linear 
transformation (32) of f’ into f has a unique inverse, which proves the 
assertion. 

It is also seen that, if (32) is identified with (2) in virtue of (33), then 
the m-th of the functions (4) can be described as follows: For every m, the 
function @m(n) of n has the period m and is 0 or 1 according as 1=n < m 
or m= m (in fact, this description of the matrix of (32) is equivalent to 
the sieve process of Eratosthenes). But (8) shows that this tsi of 
the functions (4) is equivalent to the explicit formula 


(34) Cm(n) = (1/m) 3 e(nj/m). 


According to (7) and (11), this means that 


(35) Gm (7) ee 1/m 
and 
(36) = e™(n). 


Finally, it is seen from (1), (7) and from the case j= m of (35), that 
M (em) =1/m. 

Thus it is clear that, if (2) is given by (32) and (33), then the assertion 
of (I) can be formulated as follows: 

(IIL) There exists for every function f a unique function f’ by means 
of which f is representable in the form (32). Suppose that the function f 
belonging to f satisfies the condition 


=1 


(37) 


Then the function f ts almost-periodic (B) and has the Fourier series (B) 

(38) f(n) ~ Sarer(n), 
r=1 

where c,y(n) denotes the Ramanujan sum 


(8,r)=1 


(39) Cr(n) = & e(ns/r) 
1SeSr 
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ond the Fourier constants a; are given by the absolutely convergent series 


(40) Ar = f'(k)/k; 
rik 


so that, in particular, 


(41) M(f) =a, —3f'(k)/k. 
k=1 


This particular case of (1) was found in [11], pp. 24-25 and p. 30. 
Considerations with regard to an “arbitrary” function f which sound like 
(III) but actually either deal with trigonometric identities, rather than with 
Fourier series, or postulate, rather than prove, the existence of a Fourier 
series (38) of some sort are, of course, of a much older date; cf., e. g., the 
considerations of Libri [8] or Carmichael [2]. Incidentally, it is sufficient to 
compare (14) in (I) or (23) in (II) with the uniqueness theorem of a 
Fourier series (B), in order to see how meaningless a postulation of the 
specific expansion (38) in (III) would be. In this connection, cf. a question 
raised by Hardy [6]. 

5. Let now the “Eratosthenian” process (32) be replaced by the 


“ Eulerian” process 
(l,n)=1 


(42) f(n)= = f*(l) 


1=l=n 


(“ Eulerian,” since, in virtue of (42), 


(43) f(n) =¢(n) when f*(n) =—1; 
incidentally, from (39), 
(44) f(n) =cr(n) when f*(n) =—e(nr), 


where r is any fixed positive integer). 
It is clear from (42) that f(1) = f*(1) and f(2) = f*(1); hence 


(45) f(1) =f(2). 


Conversely, if f = f(n) is any function satisfying the initial restriction (45), 
then there exists a unique function f* = f*(n) by means of which f is repre- 


sentable in the form (42). In other words, the functional notation 
(46) 

is justified if (42) is thought of as written in the form (2), provided that 
the trivial necessary condition (45) is satisfied. In fact, if n= 2, then (/,n) 
is 1 for 1 —=n—1 but is not 1 for]—n. It follows, therefore, from (42) 
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that, if n= 2, then f(n) = Fn. + f*(n—1), where Fy_, denotes the sum 
of certain of the n —2 values f*(m) belonging to m =1,---,n—2 (with 
the understanding that the number of these values can be 0, in which case 
Fn-2 = 0). Consequently, if (45) is satisfied, then, since f*(n—1) = f(n) 
— Fy. for n= 2 and f*(1) = f(1), f*(2) =f(1), the values of f* can be 
determined, recursively, from those of f for every n. This proves the assertion. 

Let (42) be thought of as written in the form (2), where the func- 
tion (46) is arbitrary. Then, since Fn. does not contain f(m) when 
m >n—1(> 0), condition (3) is satisfied by Nm = m—1 if n = 2, while 
N, = 1. Actually, the sieve process of Eratosthenes, referred to in connection 
with (32), can readily be adapted to (42) and leads, when (42) is written in 
the form (2), to the following description of the structure of the column 
functions (4): For every positive integer m (including m —1), 


(47) em(n)=0 if lSn<m and en(n)—e™(n) if 


where e”(n) denotes 1 or 0 according as n and m are or are not relatively 


prime (so that, in particular, 


(48) é,(n) = e'(n) = 1 
for every 7). 

Clearly, the function e™(n) just defined is a periodic function of n and 
has m as a period. Although the primitive period of e™(n) is in general 
(namely, unless m is square-free) less than m, it follows that e”(n) can, 
for every m, be represented in the form (11). The coefficients #m(j) of (11) 
can be calculated from (13). This gives 


(8,m)=1 
Mam(j) = e(—-js/m). 


1Ss=m 
But the sum on the right of this relation remains unaltered if —j is replaced 
by j. Hence, from (39), 
(50) &m (7) — Cu(7), m., 
It follows, therefore, from (11) that the function e”(n) occurring in the 
representation (47) of the matrix of the linear transformation (42) of f* 
into f can be given explicitly as follows: 
™m 
(51) e™(n) = Cm(j)e(nj/m) /m. 
j=1 
For the column functions (4) themselves, (51) implies the Fourier 


series (B) 
(52) €m(n) Cm(j)e(nj/m)/m, 
j=1 
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by (11), (10) and (7%); cf. (9) and (47). In particular, 


(53) M (| em |) = M(em) =¢(m)/m. 
In fact, the constant term of the Fourier series (52) belongs to j =m 
and has, therefore, the value cn(m)/m. This means, by (1), that M(em) 
=Cm(m)/m. Hence, (53) follows by placing r—m and n=™m in (39) 
and using the fact that e», = 0. 
If all of this is combined with (I), there results the following theorem: 
(IV) For an arbitrary function f =f(n) subject to (45), there exists 
a unique function f* = f*(n) by means of which f is representable in the 
form (42). Suppose that the function f* belonging to f satisfies the condition 


where @ denotes Euler’s function. Then f is almost-periodic (B) and has the . 
Fourter series (B) 


(55) f(n) 


where the cry(n) are the Ramanujan sums (39) and, if p denotes Mobius’ 
function, the Fourier constants a, are given by the absolutely convergent series 


k 


56 = 
( ) rik 
so that, in particular, 
co 
(57) M(f) =a, 3 f*(k)6(k)/k 
and 
(58) a,==0 unless r is square-free. 


In fact, if (2) is identified with (42), it is seen from (46) and (53) J 
that the assumption (5) of (I) becomes the assumption (54) of (IV). On 
the other hand, (46) and (50) show that the assertion (15) of (1) for (14) 
can now be written in the form 

where (s,r) =1, by (21) or (14). But it is easily verified from (39) and 
from the definitions of Euler’s ¢ and of Mobius’ » that 


(60) Cra(ks) = (kr)u(r)/o(r), where (s,r) =1, 


holds for & = 1,2,---3 cf. (61) below. Consequently, (59) can be written § 
in the form a-* —a,, if a, is defined by (56). Finally, since a-* =a, holds 


H 
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for all ¢(r) integers s satisfying (21), it is seen from (39) that (14) becomes 
precisely (55). 

This completes the proof of (IV). The crucial point was, of course, that, 
just as in (III), the ¢(r) coefficients of the interior sum in (14) proved to 
be independent of s. Correspondingly, both (III) and (IV) can be thought 
of as paradigmata of (II), the class functions B-(n) of the representations 
compatible with (5) being the functions c,(n), | u(1r)| cr(m) in the respective 
cases (III), (IV). In particular, 8,(n) is missing in the second (but not in 
the first) case unless | »(r)| = 1, that is, unless r is square-free. 

It may be mentioned that the elementary identity (60), used in the 
reduction of (59) to (56), is just the particular case of the relation 


(61) cm(m)/p(m) = n)), 


pointed out by Tegnér [10] for any pair of positive integers m,n and their 
icast common divisor (m,n); cf. also Holder [7]. 


6. Any function f satisfying (45) determines a unique pair of functions 
f’, f* by means of which f is representable in the respective forms (32), (42). 
Hence, (III) and (IV) supply two sufficient criteria for one and the same 
property, namely, the almost-periodicity (B), of an arbitrary f. In fact, the 
initial restriction (45), which is necessary in (IV), is quite immaterial, since, 
if f(m) is given for every n > 1, the value of f(1) can be chosen arbitrarily, 
a finite number of values f(m) having no influence on the almost-periodicity 
(B) or on the Fourier expansion (B) of f. 

It will now be shown that neither of the theorems (III), (IV) is con- 
tained in the other: 


(V) The two sufficient criteria which (III) and (IV) supply for the 
almost-periodicity (B) of a function f are independent, that is, condition (37) 
neither implies, nor is implied by, condition (54). 


In fact, if the values f’(1), f’(2),-- - are so chosen that (37) is satisfied 
and the numerical series (40) belonging to r= 4 has a sum distinct from 0, 
then, since (54) implies (58), it is clear that (37) cannot suffice for (54). 

The remaining negation of (V) follows by choosing f*(n) to be 1 or 0 
according as n is or is not 2. Then (54) is satisfied, since only one term of 
the series (54) is distinct from 0. It remains to show that this example does 
not satisfy (37). 

First, since f*(2) =1 and f*(n) =0 for n= 2, it is seen from (42) 
that f(n) is 1 or 0 according as n is or is not an odd number distinct from 1. 
Since (32) is equivalent to Mobius’ inversion 
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(62) 3 u(n/d) f(a) 


for arbitrary mates f, f’, it follows that the present f’(n) is identical with the 
sum }y(n/d) in which d runs through all those odd divisors of n which are 
distinct from 1. Hence, if n is an odd prime p, it is clear that f’(p) is repre- 
sented by the single term yp(p/d) —y(p/p); so that f’(p) =p(1) = 1. 
Since the sum of the reciprocal values of all primes p is divergent, it follows 
that (37) is violated. 

This completes the proof of (V). 

Due to the representation S(n/d) of f’(n) in the last example, it is 
possible to conclude from the prime number theorem that, in this example, 
the series 

(63) (k)/k 

k=1 
is convergent (although (37) is not satisfied). This suggests the question as 
to whether or not (5) always implies the convergence (though not, of course, 
the absolute convergence) of (63). The answer proves to be affirmative. In 
view of (I) and (IV), much more than this assertion is contained in the 
following theorem: 


(V1) If the function f(n) is almost-periodic (B), then (63) ts a con- 
vergent series. 


The point in (VI) is that the assumption of the mere existence of M(f), 
an assumption weaker than the almost-periodicity (B) of f, is insufficient for 
the convergence of (63). This was shown by an appropriate construction 
([11], p. 13), while the question as to the sufficiency of the almost-periodicity 
(B) of f remained there undecided (cf. [11], p. 26). 

If f is almost-periodic (B), the same is true of | f |. Hence, the mean- 
value (1) exists for both functions h =f, h—=|f |. This means that M(f,) 
and M(f.) exist for the linear combinations f, =|f|+f, f.—=|f|—f of 
t,|f|. On the other hand, it is clear from the distributive character of the 
connection (32) or (62) between the f and f’ that, if (63) converges when f 
is replaced by fi, where i = 1, 2, then (63) must converge for f = $(f; — fe) 
itself. But M(f;) exists and f; =0 holds for 11,2, if, without loss of 
generality, f is assumed to be real-valued. Hence, in order to prove (VI), 
it is sufficient to ascertain that the existence of M(f) and the assumption 
if | 0 imply the convergence of (63). But the truth of this implication 
was recently established ([12], p. 6) ; it depends on somewhat more than the 
prime number theorem (cf. [12], p. 9). 

Incidentally, (VI) itself contains the prime number theorem. In fact, 


“ 


= 
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f(m) is 1 or 0 according as n=1 or n>1, then f(n) clearly is almost- 
periodic (B), with 0-+0-+---- as Fourier series, while (62) shows that 
f’(n) becomes identical with »(n). It follows therefore from (VI) that the 
series (63) converges when f’ is Mobius’ function ». But this is known to be 
equivalent to the prime number theorem. 

In the above proof of (VI), only the existence of M(f) and M(|f |) 
was needed. Hence, (VI) can be generalized as follows: 


(VII) Jf f is a function for which both M(f) and M(|f |) exist, then 
(63) is a convergent series. 

That (VII) actually is more general than (VI), follows from the exist- 
cnce of functions f which attain only the values 0 and 1 but fail to be almost- 


periodic (B), although M(f) exists ([11], p. 28). 


Remark. Corresponding to Mobius’ inversion (62) of (32), there is an 


explicit inversion formula of (42), (45), namely, 


(64) f*(n) = Sf(d)— f(4), where n>1; f*(1) —f(1). 


din din- 


In fact, it is clear from (42) that 


But (65) is equivalent to (64). 
Incidentally, (32) shows that (65) can be written in the form 


(66) f(n) = F’(n), 
where 
(67) F(n) — 3 f*(k). 


Since (62) is equivalent to (32), the explicit form of (66) is 
(68) f(n) = Sp(n/d)F(d). 

din 
Thus (42) is identical with (68) in virtue of (67). 


7. In view of (II1) and (IV), it seems to be worthwhile to mention 
an instance of a Fourier expansion (14) which fails to be of the particular 
form of the Ramanujan series (38) occurring in (III), (IV), although it 
belongs to an almost-periodic (B) function f(n) of classical arithmetical 


significance : 


(VIII) Let f(n) denote the characteristic function of the set of the 


le 
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I 
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positive integers which are sums of three squares. Then f(n) is almost- 
periodic (B) and has the limit-periodic Fourier series (B) 


(69) 


where q is odd, e(x) = exp (2mix) and 


(70) dq = — log {1 -- €(q/8)}, 
if the imaginary part of the logarithm is chosen between + zr. 


It is understood that by the characteristic function of a set of positive 
integers is meant the function f(n) which is 1 or 0 according as n is or is not 
in the set. 

Let u and v be fixed positive integers. Then, since 


u 

«(nj/u) 

j=l 

is w or 0 according as n is or is not divisible by wu, the characteristic function 
of the arithmetical progression 


(71) utv, 


is represented by the trigonometric sum 


(72) (1/u) vj/u)e(nj/u). 
j=l 

According to Legendre, the sums of three squares are the integers of the 
form 4*-?(8m—1), where and m=1,2,---. Let f*(n) 
denote the characteristic function of the subset which results when & is fixed 
and m varies. Then, if f(n) is defined as at the beginning of (VIII), it is 
easy to prove that f is almost-periodic (B) and that, as a matter of fact, the 
k-th partial sum of the series ft + f*-+-- - tends to f in the mean of the 
(B)-space, as k-—> co ([11], p. 27). Hence, corresponding to the transition 
from (17) to (14) in the proof of (1), the Fourier analysis (B) of f can be 
obtained by collecting the amplitudes which belong to a fixed frequency in 
the Fourier expansions of the & functions f1,---,f* and then letting 
k—»c. But this calculation can readily be carried out, as follows: 

The set having the characteristic function f* was defined to consist of the 
integers 4*-1(8m—1), where & is fixed and m=—1,2,---. Clearly, this 
set is identical with the case u = 21, v = — 4*" of the arithmetical pro- 
gression (71). Hence, the Fourier series (B) of f(n) can be obtained by 


aC 
r=1 
q 
| | 
4 
3 


ive 
not 
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substituting w= 2°**1, y = — 4* in (72) and then performing summation 
over all positive integers k. Consequently, the Fourier series (B) of f(n) is 


o2k+1 


k=1 j=1 


Since e(m) = 1, this means that there exist certain coefficients b,? satisfying 


(74) f(n) ~ 32243 


r=1 1595 


In order to prove (69) and (70), let r,q be two positive integers the 
second of which is odd and less than 2". Then 0,4 in (74) is the sum of those 
coefficients 2-**-*e(7/8) of (73) for which the quotient j/2*** multiplying n 
has the fixed value g/2". Since (q, 2") = 1, these pairs of summation indices 
j,k can be parametrized by placing 7 = qm and = where m runs 
through all positive integers. Consequently, ; 


m=1 
It follows therefore from (8) that (since the expansion — log (1— 2) 
co 
= mz” remains valid on |z|—1 if 
m=1 
(75) b-4 = — 2° log {1 — e(q/8)}, 


if the determination of the logarithm continuously is derived from that of 
log (1— where |z| <1 and log1—0. 

This completes the proof of (VIII), since (74) is identical with (69) 
in virtue of (75) and (70). 


8. Theorems (III), (IV) and (VIII) have been deduced directly from 
(1), rather than from the complicated criterion (II). However, all that is 
responsible for the complications in (II) is the fact that the sufficient criterion 
supplied by (II) is necessary as well. Thus it ‘will be convenient to isolate a 
simple particular case of (II), supplying a sufficient criterion which contains 
not only (III), (IV) and (VIII) but is also applicable, for instance, to the 
various sequences of Kloosterman’s sums. 


(IX) Let e,(n), es(n),-+: be a sequence of functions which are almost 
periodic (B) and have the following property: There exist constants 


(76) a*, where (s,r)=1,lSsSr and r=1,2,:-:-, 


such that the Fourier series (B) of the functions e,,é2,--++ of n are 
expressible in terms of the sequence B;,B2,- + ° of the trigonometric sums 
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(77) Br(n) = e(— + ns/r) 
in the form aed 

(78) ex(n) ~ Ba(n) 
(which is equivalent to | a 

(79) Br(n) ~3 u(r/d)ea(n), 


by Mobius’ inversion). Then, if g(n) is any function satisfying 


co 
(80) M(| ex 9(&)| < 


the function f(n) defined by the linear transformation 


f(n) = en(n)g(n) 


is almost-periodic (B) and has the Yourier series (14) of the particular type 
(82) f(n) ~3 DrB-(n), (B), 


in which the (“shifted”) Fourier constants b, are given by the absolutely 
convergent series 
(83) by = 3 g9(k). 

In order to see this, let (81) be identified with (2). Then (3) is satisfied 
by Nm =m. On the other hand, due to the possibility allowed in (11), (7) 
and (9), it is clear from the device (47) applied in the proof of (IV), that 
the identification of (81) in (IX) with (2) in (1) is admissible (in (47), 
the assignments n < m and m =n now become nm and m < n respec- 


tively). Hence, if Am(k) in (30) is chosen to be 1 for every m and every k, 
it is sufficient to go over the assumptions and assertions of (II) step by step, 


in order to obtain (IX). 

The formula corresponding to (41) in (III) is somewhat obscured in 
the generalization (IX) of (III), although (83) corresponds to (40). First, 
it is clear from (77) that only the first of the trigonometric sums B,(n), 
B2(n),: contains a term independent of n, and that is identical 
with this term, that is, with the constant e(— ,'). It follows therefore from 
(82) and (1) that M(f), instead of being represented by the case r= 1 of 
(83), has the value 


(84) M (f) = = 
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This also explains an apparent paradox in the wording of (IX), namely, 
the absolute convergence of the series (83) whenever (80) is satisfied. In 
fact, if all but one of the values g(1),9(2),°-- are chosen to be 0, it follows 
irom (81) and (84) that 


(85) M(e,) = M(e2) =- = €(—a@,") 


(this can be seen from (78) also). But 1/(|h |) cannot be less than | M(h)|, 
and the exponential function e occurring in (85) cannot become 0. 


9. All of the arithmetical algorithms (2) considered thus far led to 
Fourier expansions (B) which, instead of having the general limit-periodic 
form (14), induce a “class condition,” expressed by the phase rule (22) or 
by the corresponding reduction of (14), in (1) to (82), (77) in (IX). One 
might expect that this will be the case for every algorithm (2) of classical 
arithmetical interest. But it turns out that there is at least one classical 
algorithm, of substantially arithmetical significance, in which (14) cannot be 
contracted into (82), (77). It results if the “ Eratosthenian ” and “ Eulerian ” 
substitutions, (32) and (42), are replaced by the algorithm 


(86) 


which underlies Dirichlet’s divisor problem ([3]; cf. [4]). 

In (86), the bracket [ ] is the symbol of the greatest integer not 
exceeding the quotient n/m. Hence, if (86) is thought of as written in the 
form (2), where 


(87) g(n) = 


then (3) is satisfied by Nm—m—1 for m—1,2,---. It is also. seen 
from (86) and (2) that, while 


(88) f(1)=0, =0, 


f(n) will contain f°(n—1) for every n > 2. In other words, if n > 2, then 
f(n) is of the form f(n) = + @n-1(n) f°? (rm —1), where is a linear 
form in the n — 2 values f°(m) preceding f°(n — 1), and the absolute con- 
stant en_,(n) does not vanish. Consequently, there exists for every function 
f(n) satisfying the initial restriction (88) a function f°(m) by means of 
which f(n) is representable in the form (86) for every n, and f°? is uniquely 
determined by f and by an arbitrary assignment of an initial value 


(89) fo(1). 
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For every positive integer m, let e™—e™(n) denote the non-negative 


function assigned by 


(90) em(1) =1/m, e™(2) —=2/m,:- -, 
e™(m—1) = (m—1)/m;  e™(m) =0 


and by the requirement that e"(n) = e™(n + m) for every n (it is under- 
stood that, if m —1, the assignment (90) gives 


(91) e(n) =0 


for every n). In terms of these functions e"(n), define the functions ém(n) 
by (47). Then it is clear that (86) and (87) can be written in the form (2). 
Since the function e» is non-negative and becomes identical with the periodic 
function e” after the first period of e», it is also clear that M(| em |) = M(em) 
exists and is identical with the arithmetical mean of the m values (90). Hence 


m-1 


M (| em |) j/m?—> 4 


j=1 

as m—>o. Consequently, (5) is satisfied if and only if the series formed by 
the values (87) is absolutely convergent. This leads to the following theorem 
fin which the Fourier expansion (B) does not have the phase properties 


specified above) : 


(X) For an arbitrary function f =f(n) subject to (88) and for any 
choice of an initial constant (89), there exists a unique function f° = f°(n) 
by means of which f is representable in the form (86). Suppose that the 
function f° belonging to f satisfies the condition 


| f°(k)| < 


1 


(92) 


Then f is almost-periodic (B) and has the Fourier series (B) 


(8.r)=1 
(93) f(n)~> & a,*e(ns/r), 
r=1 
in which the Fourier constants are given by the absolutely convergent series 
r|k 


where the X denote absolute constants 


k-1 
(95) Ar? (k) = ne(—ns/r). 
1 


n= 


The sum (95) denotes 0 when it is vacuous, that is, when k = 1; cf. (91). § 
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First, the assumption (92) of (X) was seen to be identical with the 
assumption (5) of (1). Next, substitution of (90) into (13) gives 


m-1 
(7) = ne(— jn/m). 
n=1 
Hence, from (15) and (87), 


k-1 
= k-*f(k) Xme(— (ksn/k)/r). 
n=1 


But this can be written in the form (94), if A-*(%) is defined by (95). 
Accordingly, (X) follows by applying (I) to the case of Dirichlet’s 
algorithm. 


10. If 1S and n—1,2,---, let denote the residue 
which remains when n is divided by m. In view of the description of the 
matrix of (86) in 9, it is natural to raise the question as to the asymptotic 


behavior of the average fluctuation in the set 


(96) ™m(1), tn(2),° Ta(™),° 


of the subsequent division residues of n, as n—> oo. For a fixed n, this average 
fluctuation is meant to be the arithmetical mean of the absolute deviations of 
the consecutive elements in the ordered set (96) of n non-negative integers, 
that is, the arithmetical mean 


(97) dn = (1/n) tn(m + 1) —rn(m)|. 


. 


The value of 7,(m-+ 1) for m=» in (97) is, in the main, immaterial as 
n—> oo; it can naturally be defined by 


(98) m(n+1) =n, 
since the least non-negative residue (modn-+1) of n is n. 
An answer to the question is contained in the following fact: 
The average fluctuation of the consecutive division residues (96) of n ts 


of the order of log n; in the sense that, if dn is defined by (97), 


(99) 0 < lim inf ¢,/log n S lim sup ¢n/log n < ow. 
OO 
It will remain undecided whether or not this answer is of a final nature, 
since the approach to be followed will neither prove nor disprove the possibility 
that the (positive and finite) lower and upper limits coincide in (99). 


As recently proved in another connection ([11], pp. 11-12), 
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n | [2] m [ n | m+1 
= 
m=1 m jn m--1 n 


On the other hand, if em(n) denotes, as in 9, the coefficient of f°(m) in the 
n-th of the equations (86), then, since ¢m(n) =n/m— [n/m], it is clear that 


Mem(n) — (m + = m[n/m] — (m+ 1)[n/(m+1)]. 


But it is also clear from em(n) = n/m— [n/m] that 


> 4 log n — const. 


Tn(m) = men(n), 


since [n/m] denoted the integral part of the quotient n/m, while rm(n) was 
defined as the least non-negative residue of » (mod m). The last three formula 
lines imply the first of the inequalities (99), since ¢, is defined by (97). 

In the direction of the remaining assertion of (99), it is clear from the 
last two formula lines that 


| rn(m) —rn(m + 1)| S| [n/m] — [n/(m + 1)] | m+ [n/m]. 


Since [n/(m-+1)] is non-negative and does not exceed [n/m], the sign of 
absolute value is superfluous on the right of this inequality. Consequently, 
summation with respect to m gives 


(100) Non S & [n/m]m [n/(m +1)]m-+ [n/m], 
m=1 m=1 1 


m= 
gn being defined by (97). If m is replaced by m — 1, the second of the three 
terms on the right of (100) appears in the form 
n+1 
— 3 [n/m](m—1), 


which, since [n/m](m—1) vanishes for and for m=n-+], is 
identical with 


n n n 
—3n/m(m—1), ie., —S[n/m]m+ [n/m]. 
m=1 m=1 m=1 


Accordingly, (100) can be contracted into 


Non 2 [n/m]; so that Pra S231/m, 


m=1 m=1 


since [n/m]/n=1/m. It follows therefore from = 1/m~ logn that the 


m=1 


proof of the last of the inequalities (99) is complete. 
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A PARTITION FUNCTION WITH THE PRIME MODULUS P > 3.* 


By JoHN Livincoop.** 


1. Recently Lehner [2]* developed a convergent series for p,(n) and 
pz(n), the number of partitions of a positive integer n into summands of the 
form 5/ + 1 and 5] + 2 respectively. The purpose of this paper is to develop 


a similar series for p,(n), po(m).° +, the number of partitions 
of a positive integer n into summands of the form pl+1, pl+2,:--, 


pl + (p—1)/2, p being a prime greater than 3. Here, as in the previous 
paper, we follow the method of Rademacher [5]. 


We consider the generating functions 


Fi(z) = 1—2z')1 » Da(n)a” 
]==+ a (mod p), (a=1,2,---,(p—1)/2 


convergent within the unit circle. To determine the asymptotic behavior of 
F,(x) near a rational point on a circle concentric to the unit circle but in- 
terior to it, we subject x to the transformation +— 2’ where 


(1. 2) == exp(2mih/k —2wz/k), =v’ = exp (2mih’/k — 2x/kz). 


Here ®@(z) >0, h and & are coprime integers satisfying < k, and 
h’ is any fixed solution of 
(1. 21) hh’ =—1 (mod k). 


We then derive a functional equation connecting F'a(2) and F,(2’) for all k 
divisible by p (6 =1,2,- - -, (p—1)/2 depending upon h). If p{k, we write 


(1.3) = exp (2rtH’/k — 2x/Kz), HH’ =—1 (mod k) 


and find a similar relation between F,(x) and certain new functions H,(2”). 
Certain sums of roots of unity appear. The application of the Rademacher 
method requires that these sums be reduced to incomplete Kloosterman sums, 


so that a better estimate than O(/) will result. 


* Received April 24, 1944. 
** The author wishes to express his appreciation to Professor Hans Rademacher 
for suggesting this problem and for giving much helpful advice in its solution. 

1 Square bracket numbers refer to the bibliography. 
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We now introduce the Cauchy integral and apply the transformation 
3 equations (3.93) and (3.94). If p|k we obtain 
(p-1)/2 
Pa (n) = og(h, k)exp(-— 2aihn/k) 
b=1 
oO 
(1.4) f > po(v) exp (2rih’v/k) 
d y=0 
e X exp{—(2/k*w) (2v — B/6p) + rw (2n — A/6p) }d¢, 
OShCESN, B= p?—6pb + 
o Next the integrand is split into two parts, depending upon the sign of the 
coefficient of w-'. We note that this coefficient is positive for v= 0,1,-- -, 
” [B/12p]. For certain p, v0 is the only admissible value. This is the 
result that Lehner obtained for p= 5. It is also true for all p= 1%. How- 
ever, for other p, there may be more admissible values for v. For- example, 
if p19, v0 or1. This interesting information results in the extension 
of the previous work. The contributions furnished by the case p|& when the 
coefficient of wt is negative, and by the case p{k, follow as in the special 
case p= 5. 
Finally we obtain asymptotic ratios for p,(n), po(n),° 
in- 
I. The Transformation Equations. 
2. The first object is to derive a transformation equation for F(a) in 
and (1.1). Consider the case p|k. Fa(x) is regular, without zeros, in the unit 
circle; therefore log F,(«) is single-valued in the same region if we choose a 
specific branch of the logarithm, for example, the one given by log F(0) = 0. 
Then 
ll k x a0 
(2.1) Go(x) = log Fa(x) =— S log(1—z') (2'"/m), 
+a (mod p). 
Placing 
«= exp(2rih/k) X X, X—exp(—y), >0 
cher § (2. 2) m = gk + =n 1, 25° °° 
l=rk+a, 1SA<k, A=+a (mod p) 
in (2.1) we have 
(2.21) Ga(x) S exp(2rihprA/k) exp{—(rk +A) (qk + 2) y} 
acher N rq qk +p 


where A, », 7, and g assume the values described above.” 


* Throughout this paragraph, \, 4, gq and r assume these values. 
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Application of the Mellin formula to (2.21) gives 


T'(s) ds 
Ga(z) = ~ ex thprA/k 
( ) p(2 pa/ ) ( / gk (rk: +- 2d) 8 (qk + p) sy8 
oo 
= exp 2rthprA/k ) (1/221) (r + d/k)-s > + 
(a) y r=-0 q=0 


r 
(2.3) Ga(z)— Dexp( A/k)E(1 + 8, w/k) ds 
where 
1 


is the Hurwitz Zeta function, and where the («) under the integral sign means 
that the path of integration is from «—io to a+ i0. 
Recall the Zeta functional equation 


9 k 
{(s,A/k) =T(1—s) {sin(as/2) cos 2adv/k €(1 — s, v/k) 
v=1 
(2. 4) 
+- cos (7s 5/2) sin - £(1 —s,v/k)}. 
Applying (2.4), replacing* exp (2rihpA/k) by cos (2ahpA/k) and 


exp (2zihpA/k) by isin(2rhpA/i:) in the first and second terms of the right 
member of (2.3), and setting z= (yk/27). we have, after simplification 


Dig 28 cos 78/2 
(2. 41) 
+ Hsin’ sin 
X 


+8, 


(a) sin 2 


x 


where exp(slogz) and where we can take | & logz| < 7/2 since 


(z) > 0. 
We now define p by setting 
(2.5) h’p (mod k), 0<p<k 


where h’ is given by (1.21). This, together with (1.21), shows that 


(2. 51) p=—hdA (mod k). 


* This is permissible since we sum over \ and since \ = +a (modp). 
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In addition, since p|k, (1.21) implies hh’==—1 (mod p) and hence, by 
Fermat’s Theorem, 
h’ =— h?? (mod p). 


Since p|& and A= -+ a (mod p), (2.51) implies 
(2. 52 p=+tha=+ (mod p). 
Introducing the new summation letter p and replacing s by —s, we have 


= (1/4zik?) cos 2xpp/k cos 2rh’pv/k 


8 
) cos 18/2 


 £(1—s,n/k) +8, v/k) 
x 


+ (1/4rk*?) sin 2app/k sin 2h’ pv/k 


(-a) 


2-8 sin /2 


where the restrictions on p are given by (2.5) and (2. 52). 

The appearance of cos zs/2 and sin zs/2 in the denominators enables us 
io shift the path of integration from (—a) to (a), taking into consideration 
the residues at the various poles. Doing this and comparing the result with 
(2.41), we find two expressions of the same form, except that we now have p 
instead of A, h’ instead of h, » and vy interchanged, and z* instead of z. 
Hence, using the definition (1.2) of 2’, we obtain 
(2. 7) Ga(x) = G,(2’) — 2wi(R, + 
where R, and R. are the sums of the residues of the first and second terms 
respectively in the right member of (2.6) with the sign of integration moved 


to the extreme left in each case. 
These residues turn out to be the following: * 


(2.8) R, = Az/12%ukp — B/12ikpz; 
k-1 
R,=—4>3 ((A/k))((AA/k)), = + (mod p) 
Az=1 
where 
(2. 81) A = p? — 6pa + 6a’, B= p? — 6pb + 60°, 


((x)) [2] —44 48(2). 
Thus, by use of (2.7%) and (2.8) and by taking exponentials, we obtain the 
desired transformation equation for the case p|k: 


‘Calculation of the residues follows as in Lehner’s paper. 


/Ie)-* 
/Te) ds 
(2.6) 
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(2. 9) Fa(@) == k) exp{ (x/6kp) (B/z — Az) }Fy(2’). 

Here z and 2’ are defined by (1.2), b by (2.52), A and B by (2.81) and 
(2. 91) wa(h, k) = exp {rioa(h, k)} 

with 


(2.92)  oa(h,k) -3 ((A/k)) ((hA/E)), (mod p). 


3. Now we consider the case p{k. Here Fa(x) does not transform into 
F(z’) ; however, F(x) will be transformed into a certain new function by a 
method similar to the preceding one. Choose 2”, H, H’, K as in (1.3) and 
place 

=exp (2rih/k) XX, (—y), >0 
(3.1) m= qk + p, 
+A, x, A=-+a (mod p). 


As in 2, we find ® 


Ga(z) = (1/4mkK) cos 2xhprA/k cos 2rdrAv/ K 
AN wr 


f (1+ 8, 
(a) 


cos 18/2 


A 


+ (1/4ckK) sin 2xhpdr/k sin 


€(1+ 58, 
(v= 1,2,°°-°,K). 
Define A* by setting 


(3. 3) A= pH’r* (mod k), 
where H’ is any fixed solution of HH’=—1(modk) and H=ph. We 
observe that (3.3) implies AX*==—/A (modk). Introducing A*, replacing 


s by —s, and shifting the path of integration as in 2, we obtain 
Ga(x) = (1/4mikK) cos 2rpr*/k cos 
p/ke)€(1 + s,v/K) 


x ds 
(a) COS 
(3. 4) + (1/4rkK) sin 2xpaA*/k sin 
(a) sin 


5 Throughout this paragraph, A and w assume the values given in (3.1) and » 


assumes the values 1,2,...-,A. 


(3.2) 
! 
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In order to obtain the new function into which F,(zx) is transformed, 
we now retrace our steps from (3.4). Replace cos (27Av/K) by exp (27tAv/K) 
and sin (2rAv/K) by i" exp (2riAv/K) and apply the transformation equation 
for {(s,A*/k). After simplification, we find 
= (1/2miK) exp (2ridv/K) 
A p 
x (8)/(2nk)*)£(s, + 8, ds. 
to ‘ 
a Application of the Mellin formula gives 
ad 
= = > exp (27iAv/K) (rK + {—(22/Kz) (qk + A*) (rK + 
r,q=90 
(3. 5) 
= — > D log {1 — exp [—(2x/Kz) (qk + A*) + 2md/K]}. 
N 
Examination of (3.1) reveals that since p{k, \ runs over a-complete 
residue system modulo & twice in some order. We therefore define « by 
(3. 6) ka =a (mod p), O<acp. 
This, along with (3.3), yields 
(3. 61) As pH’r* + ka (mod K). 
Inserting (3.61) into (3.5), we obtain 
La(x) = > log {1 — exp [—(22/Kz) (qk + 
A qd 
Place 
(3. 71) gk + Il, exp Pp) = 9(%), exp(— =7(2). 
Ve — Using these notations and (1.3), we find 
ng x x 
La(x) = — > log (1 — n(2)2”') — log (1 — 
as 
(3. 72) La(z) =| LT (1— n(a)a’’?) == log Ha(2”) 
=1 
where 
x x ae 
y=1 
We now calculate the residues as in Lehner’s paper, and find 
R, = Az/12ikp — 1/12tkpz + log (2 sin xa/p) ; 
(3. 8) 
K-1 
ly § R, = — ((hdA/k)) ((A/K)), A= +a (mod p) 


A=1 
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where A and ((x)) are given by (2.81). Therefore, by use of (3.4), (3. 72) 
and (3.8) and by taking exponentials, we obtain the desired transformation 
equation for the case p{k: 


(3.9) Fa(xz) =4yal(h, k) ese ra/p exp { (1/2 — Az) }Ha(v”) 

where A is given by (2.81), Ha(x) by (3.73), @ by (1.2), 7” by (1.3) and 
(3. 91) xa(h, k) = exp {rita(h, k)} 

with 


K-1 
(3.92) ta(h,k) => ((hdA/k))((A/K)), A\=-+a (mod p). 
A=1 
From (2.9) and (3.9) we have the following result: 
THEOREM 1. The function F.(x), defined in (1.1), satisfies the trans- 


formation equation 


(3. 93) Fa(exp [2xth/k — 2xz/k]|) 
= exp [ (4/6kp) (B/z — Az) | Fo (exp [2aih’/k — 22/kz]) 


for the case p|k, and the equation 


(3. 94) Fa (exp [2rth/k — 2xz/k]) 


= dya(h,k) esc exp [ (x/6kp) (1/2 — Az) ]Ha(exp [2mriH’/k —% 


for the case 


II. The Kloosterman Sums. 

4. When the transformation equations (3.93) and (3.94) are inserted 
into the Cauchy integral, certain sums of the roots of unity wa(h,k) and 
xa(h,k) appear. The estimate O(/:) for these sums is not sufficient for the 
application of Rademacher’s method. Thus, we now reduce these sums to 
incomplete Kloosterman sums, subject to the estimate O(n%k%**), according 
to Salié. 


Consider the case p|k. From (2. 92) 
k-1 
oa(h,k) = ((A/k)) ((hA/k)), A= +a (mod p). 
A=1 


However, by the same procedure as that used by Lehner,® we are able to 
replace this by 


k-1 
(4.1) oa(h,k) = (A/k) ((hA/k)) — (b/p—3$), A =a (mod p) 
=1 


where b is given by (2.52). Employing the definition of ((7)) we obtain 


®See [2], pages 644-645. 
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k) — (2h/k*) — (2/k) — (1/k) (b/p—4), 
=a (mod p) 
which upon simplification becomes 
6pkoa(h, k) = 2h{2k? +- 3k(2a— p) + A} 
(4. 2) — 3k(k — 2p + 2a + 2b) —12p S A[hA/E], 
A 
=a (mod p) 
where A is given by (2.81). Examination of (4.2) reveals that 6pkoa(h, k) 
is always integral. Also, 
6pkou(h,k) = 0 (mod 3) if 
(4. 21) 
6pkoa(h, k) = 2a + 2b + 2p —3 (mod 4) if & is odd. 
Next we shall evaluate a certain sum in two different ways,’ and ‘obtain a 
result containing o,(h, &), from which we can find the residues of 6pkoa(h, k) 


for moduli which are multiples of k. Now 


(4. 31) = ((hA/k))? ((p/k))? (p/k —4)* = (B— /6pk + 4 

p p 

A=a(modp), p==—b (mod p). 

Also 

((hA/k))? = (hA/k — [hd/k] — 4)? 

A A 
(4. 32) = hoa(h,k) + h(b/p— 4) — (h?/6pk) {2k? + 3k(2a— p) + A} 

+ ((hA/k] +1) +431; A =a (mod p) 
\ 


by the use of (4.1). Comparing (4.31) and (4.32) we find 


(4. 4) 6phkoa(h, k) = h?{2k* 3k(2a— p) + A} — — B) 
3hk (2b — p) —12pkT 


where 
T= 4 [ha/k ] ([hA/k] +1), (mod Pp) 
A 


is always integral. Let 12»—/G, where f is the greatest divisor of 12p 
which is prime to k. We obtain the following results: 


(k,12p) = p, f=12,G=—p (k,12p) = 3p, f=4,G=— 3p 
(4.5) (h,12p) = 2p | (k,12p) = 
3, , f— 1, Gm 12p. 
(k,12p) = 4p §’ f P (k,12p) =12p P 
Denote by h’ a solution of hh’==—1 (mod Gk). Since all primes in Gk, 


this congruence always has solutions because (h, k) = 1 implies (h, Gk) = 1. 
Multiplying (4.4) by —h’ and remembering that Gk|2k?, we obtain 


7 See [6], proof of Theorem 3. 


| 
1 
| | 
—h 
| 3 
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(4.61) 6pkoa(h, k) = hu — h’v — 3k(2b — p) (mod Gk) 
where 
u = 3k(2a— p) + A, v= B— k’, 


From (4.21) and (4.5) we have 


(4. 62) 6 pkoa(h, k) =6a + 6b + 6p — 3 (mod f). 
Set 
(4. 7) (mod Gk), = 1 (mod f). 


Then, from (4.61) and (4.62) we find 
(4. 8) 6 pkoa(h, k) =fo{hu — h’v — 3k (2b — p)} 
+ GkI(6a + 6b + 6p — 3) (mod 12kp). 

Thus, finally, 

wa(h,k) =exp {arioa(h, k)} = exp { (2rt/12kp) 6pkoa(h, k)} 
(4.9) = exp {2mi| (¢/Gk) (hu — h’v) -— (346/G) (2b — p) 

+ (T/f) (6a + 6b + 6p-—3)]}. 
5. We now consider the case pfk. From (3.92) and an argument 

similar to that of Lehner’s, we have 


ta(h, kb) (A/K) ((hA/k)) = 2S (A/K) (ha/k — [ha/k] 
A\=1 A=1 
(5.1) — 4-4 (mod p). 


As in 4, we obtain, upon simplification, 
6pkta(h, k) = 2h{2K? + 3K (2a—p) + A} 
(5. 2) — 3k(K — p + 2a— 2a) —12 SA[hA/E], 
A 
=a (mod p) 


where « is given by (3.6). Examination of (5.2) shows 


6pkta(h, k) = 0 (mod 3) if 3¢k 
(5. 3) 6pkta(h, k) = (p— pk + 2a + 2a) (mod 4) if & is odd 
6pkta(h, k) = 0 (mod p). 


As in 4, we evaluate ((hA/k) )*, A=a (mod p), K—1 


in two different ways, and upon comparing results find 


(5. 4) 6pkta(h, k) = h?{2K? + 3K (2a—- p) + A} 
— (k? —1) + 6hka —12kT” 


where 


T’ =} p> +1), A=a (mod p). 


nt 
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Let 12p = Fg, where F is the greatest divisor of 12p which is prime to k. 
Then 
(k,12p) 3, F=4p,g=— 3 


] 
(5.5) (k,12p)—=2) , (k,12p) = 6 
(k,12p) = 4 (k, 12p) = 12 ,F=p, g=12. 


Denote by h’ a solution of hh’ =—-1 (mod gk). Multiplying (5.4) by —h’ 
and remembering that g|12 and g|2k, we obtain 

(5. 61) 6pkta(h, k) =hu— h’v 6ka (mod gk) 

where 


u= 3K (2a—p) +A, v=1— Fk’. 


From (5.3) and (5.5) 


(5. 62) 6pkta(h, k) = 9p? (2a + 2a + p— pk) (mod F). 
Setting 
(5.7) Fé =1 (mod gk), gky =1 (mod F) 


and combining (5.61) and (5. 62), we find 
(5. 8) 6pkta(h, k) = F@(hu — h’v + 6ka) 
+ Igkyp*(2a + 2a + p— pk) (mod 12pk). 


Therefore 
xa(h, k) = exp {aita(h, k)} = exp { (2ai/12pk) 6pkta(h, k) } 
= exp (/gk) (hu—h’v) + 6ba/g + (9p?y/F) (2a + 2a -+ p— pk) 
By choosing pH’ = h’ (mod gk), and using G = pg, H = ph we obtain finally 
(5. 9) xa(h, k) = exp {2at| (@/Gk) (Hu — p*H’v) 
++ 6ba/g + (Ip*y/F) (2a + 2a + p— pk) }}. 


6. Consider the sum 


A(n,v3k3a,b3 01,02) =r =D oa(h, k) exp (— 2ai(hn — h’v) /k) 
(6.1) h 
h mod k, h = bB (mod p); 0, Sh’ 025k; pk. 


Here 3’ means the sum over all integers prime to the modulus of the system, 
and h’ denotes any solution of hh’ ==—1(modk). Examination of (2.81) 
shows wa(h,k) to be of period k. Because of the periodicity modulo k, we 
can change the modulus to Gk and select that h’ which satisfies hh’ =—1 
(mod Gk). By inserting the value (4.9) for wa(h,k#) and removing the 
restrictions on h and h’, we find that t= O(n%*k***). Hence, we secure the 
following result: * 


* For details of this development, see [2], pages 649-651. 
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THEOREM 2. The sum 
A(n,v3k3 a,b; 01,02) = wa(h, k) exp (— 2ai(hn — h’v) /k) 


(6. 2) 
hmodk, h=bB(modp), 


in which the parameters are all integers, n >0,k >0, p|k,OSo, < Sk, 
and a@=1,2,---, (p—1)/2 is subject to the estimate O(n*k%**) uniformly 


in Vv, a, b, 01, 


By a similar method, and by use of the equations G= pg, H = ph, 
HH’ =—1 (mod k), we obtain our next theorem. 


THEOREM 3. The sum 

B(n,v3k3 01,0234) = xa(h, k) exp (— 2ai(hn — H’v) /k) 
6.3 
hmodk, HH’ =—1(modk), Sh’ <a 


has the estimate O(n%*k%**) uniformly in v, 01, o2, a. 


III. A Convergent Series for p.(n). 


7. We now apply the Hardy-Littlewood method, with the sharper esti- 
mate of Kloosterman and Rademacher,’ to the generating functions Fy(2). 
We obtain 


Pa(n) = (1/2ni) (Fa(x) dx 


= >” exp (— 2zmihn/k) f Fy(exp [2aih/k — 2x(z/k)]) exp (2znwu) 
We now apply the transformation formulas, and write 


(7. 21) Pa(n) = pa? (nr) + pa’? (n) 


where pq‘) (n) is the sum of all terms for which p|& and pa‘*)(n) is the same 
for p{k. Choose 8 such that a8==-+ 1 (mod p). This together with (2. 52) 
shows that 

h = bB (mod p). 


Then we obtain from (7.1) and the remarks above 


® See [5]; the notations here are those of Rademacher. 
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(p-1) /2 
Pa) (n) = D>’ oa(h, k) exp (— 2ai(hn/k) ) 
b=1 
6” 
(7.22) f exp exp {—(w/k*w) (B/6p)) 
+ nw (2n— (A/6p))}dd, 


The sum over v is now split into two parts, Q(n) and R(n) depending upon 
B and a fortiori 6 such that in Q(m) the coefficient of wt is positive and in 
R(n) the coefficient of wt is negative. R(n) contributes nothing to the final 
result; in fact, 

(7.3) R(n) =O (exp ( crN-*)n*N-**), 


Referring to (7.21) we note that Q(n) contains all v such that vy S [B/12p]. 
If B is negative, the sum over v is vacuous. For positive B and v= 0,1,---, 
[B/12p] 

0 if 


Here p,(v) denotes the number of partitions of v into parts congruent to 
+ b (mod p) and since 


— Spb + 66° 


4/1207 = 
v< B/12p 6p 


= 4(p/6—b + b*/p) < p/12 

those parts that must be congruent to + b (mod p) must be 0b itself, for 
p—b or p+ b is too large (p—b > p/2 > p/12 >v). Also, B= 0 means 
b? — pb + p*/6 = 0, i. e., b? — pb + p?/4 = p?/12. Thus, the only permissi- 
ble b for Q(n) are 6 —1,2,---,[p(3— V3) /6]. Setting v= bp, we find 


n(3-\/3) 6] [B/12pb] 


Q(n)= DS BD ea(h, k)exp(— 2axi(hn/k) ) exp (2zih’bp/k) 
p= 


bel 


(7. 4) 
exp (2p — (B/6p)) + xw(2n— (A/6p))} a6, 
J 
h = bB (mod p). 


Evaluating Q(n), we find 


N [p(3-V3) /6] [B/12ph] 
(7.5) Q(n) > > Axo(n) (n) + O(exp(crN-*) N-%**) 
A=1 p=0 
where 


k, 
ly 
| 
3 
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(7. 51) Axo(n) = A(n,v;k; a,b; 0, k) 
and 
(B—12pbp)* — A)*(B—12pbp)* 
J-(12pn — A)% 3pk 
(7. 52) 
(B— 12pbp)* a (A —12pn)*(B— 12pbp)* 
t(A—12pn)% 3pk for on < 
Now 
(n) = 4 xa(h, esc ra/p exp (— 2ari(hn/k) ) 
hk 
(7. 6) 


x > Ca(v) exp (2rtH’v/k) exp{—(ax/Kkw) (2v— (1/6) ) + rw(2n— (A/6p) ) } dg, 
v=0 


As before, we split this sum into two parts Q’(n) and R’(n). Here Q’(n) 
contains only the value v0. We find, as before, 


(7%. 7) R’(n) =O (exp 
and 
N 
(7.8) Q’(n) =m dese (wa/p) By(n)L’,(n) + O (exp 
k=1 
where 
(7. 81) B,(n) = B(n, 0; k; 0,k; a) 
and 
1 — A)*% or 
(7. 82) 
1 § — 12pn)% 
3pk for n< A/12p. 


Upon combining (7.3), (7.5), (7.7), and (7.8), keeping n fixed and 
letting N — o, we obtain our desired result. 


THEOREM 4. The number, pa(n), of partitions of a positive integer n 
into positive summands of the form pl +a (a=1,2,-+-, (p—1)/2) ts given 
by the convergent series 


[B/12p] 
pa(n) = > Ax,»(n) Ix (n) 
b=1 p=0 
Dit 


(7.9) 


+a > By(n)! ese rak/p | L’x(n) 
k>0 


pik 


/12p 


‘12p, 


}dg, 
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where Ly(n) and L’;,(n) are given by (7.52) and (%. 82), Axo(n) by (7.51), 
and B,(n) by (7.81). 


8. To determine the asymptotic ratios of the pa(n), we have to investi- 
gate the dominant terms in each of the series of (7.9). Inspection of these 
series clearly shows that the leading term in each series dominates all others. 
Thus, we must have 
(8.1) k= p, p=0 if plik 

keel if 


In brief, the dominant terms are 


(12pn — A)*B,% 


in the series for p|& and 


T 
(8. 3) ( 12pn —A)# B,(n) | csc | I, 


a (12pn — A)* 
3p 


in the series for pf k. 


From (2.81) and (8.1), we find 


B, = p?— 6p + 6. 


Hence, 

(8. 4) B,/p? = = =1—6/p+6/p? <1. 
Therefore 

(8. 5) m(12pn — A)%B,% 2(12pn— 


3p 
From the theory of the Bessel functions, we obtain the following estimates *° 


I,(z) = O(2), <1 
I,(z) ~ exp (24z)-%, |2| >1. 
By virtue of (8.2), (8.3), (8.4), and (8.5) we note that the term origi- 


nating from the series for the case p{k dominates that originating from the 
series for the case p|k. Hence we obtain 


10See [8], page 203. 
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Pi(n): po(m): Pw-1)/2(”) 
| ese | I 3(12pn—A,)* 
(12pn—A,)*% 3p 


| ese 2ar/p | I (12pn— 
(12pn—A,)* ~* 3p 


| ese (p —1)x/2p | I § m(12pn — A 


« 


Applying (8.6), we finally obtain 


—~cscm/p : +: esc (p—1)x/2p. 
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ON THE LACK OF AN EUCLIDEAN ALGORITHM IN R(\/61).* 


By L. K. Hua and W. T. SHIH. 


It has been proved by A. Brauer * that the fields #( Vp), pa prime = 13 
(mod 24), are non-Euclidean, except possibly for p = 138, 37, 61 and 109. In 
the cases p = 13 and 37 the fields R(V p) are Euclidean. Thus it is an open 
question whether the fields R(V61) and R(V109) are Euclidean or not, 


TirkEoREM 1. The quadratic field R(V61) is non-Euclidean 
It is known that Theorem 1 is a consequence of 
THEOREM 2. For any integers x, y we have always 


l(a (1/2)y — 1/78)? — (61/4) (y —17/39)?] > 1. 


Let 
u= 78a + 39y —1, v = 39y — 17%, 


If Theorem 2 is false, then we have integers wu, v and k such that 


(1) — 61v? =k, where | k | S 78° = 6084, 


Since 
(2) u==—1, v ==— 17 (mod 39), 
we have 


k == 1° + 17*=0 (mod 39) ;s 
furthermore since 


k= (y+1)?— (y+1)?=0 (mod 4), 


k 
we have that & is a multiple of 156. (== 4 & 39). Moreover, evidently (4) 
= 1, since / is not a multiple of 61 (in fact 61 X 156 > 6084). By these 
two properties we may list all the possible values of 7, where & = 156] and 
39: 


0, +1, +3, #4, 45, +9, 412, +13, +14, +15, +16, 
+19, + 20, + 22, +25, +27, 434, + 36, + 39. 


* Received April 25, 1944. 
1 Brauer, American Journal of Mathematics, vol. 62 (1940), pp. 697-716. 
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For /=0, + 14, = 22, + 34, (1) is insoluble. In fact, e. g., if 


u? — 6lv? = + 14 X 156, 
then 
u* = 61v? (mod 7), 


and (=) = — 1, which is impossible. 
We now give the solutions of (1) for these possible values of k. 
LemMa 1. The numbers of primary? solutions of (1) are given by the 


following table: 


l | +143 +445 +9 +12 +13 +15 +16 +19 +20 +25 +27 +36 +1 
no. of pri- | 
mary solu- 4 648 8 6 6 12 4 8 8 12 10 8 


tions of (1) | 


The proof of the lemma is a straightforward calculation of 


> 
nlk\ 


which is the number of primary representations of k. 

LemMa 2. If u®—61v0°=k has no primary solution satisfying 
(2’) + 1, v= +17 (mod 39), 
then u? —61v? = +k has no solution satisfying (2’). 

Proof. Since »= (39+ 5V61)/2 is the fundamental solution of 
u* — 61v? = — 4 (in fact 39?— 615? ——4), the solutions of u*— 61v’ 
= are given by 


2 2 2 ae 


where Uo, Vp is a solution of (1). 


For 1, we have 
(3) u=+4(39u + 5 X 61r), v= + + 
i.e. 
+17%U, (mod 39), 
i.e. 
Up = + 16, Vo = +17u, (mod 39) 
(since 16 17==—1 (mod 39)). If w==+1, v=+17, then we have 
Up vo = +17 (mod 89). 4 


1, S. 140. 


2? Landau, Vorlesungen iiber Zahlentheorie, Bd. 
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The proof is easily completed by repeated application of the same 


argument. 


Thus in order to prove Theorem 2 it remains only to establish that there 


is no primary solution of (1) with (2) and 0 < 1539. 


LemMMA 3, All the primary solutions of (1) are given by the following 


table: 
l 
1 20,2; 41,5; 1484, 190; 3335, 427. 
oa 3 23,1; 38,4; 282, 36; 633, 81; 5147, 659; 11567, 1481. 
bile 4 40,4; 82,10; 2968, 380; 6670, 854. 
5 29,1; 32,2; 151, 19; 337, 43; 883, 113; 1984, 254; 12473, 1597; 
8 16136, 2066. 
9 60, 6; 123, 15; 245, 31; 548, 70; 977, 125; 2195, 281; 4452, 570; 
10005, 1281. 
12 46,2; 76,8: 564, 72; 1266, 162; 10294, 1318; 23134, 2962. 
13 9%, 11; 208, 26; 463, 59; 1672, 214; 3757, 481; 8443, 1081. 
15 49,1; 73,7; 134,16; 171,21; 293,37; 378,48; 2367, 303; 3062, 392; 
5319, 681; 6881, 881; 13957, 1787; 31366, 4016. 
16 80, 8; 164, 20; 5936, 760; 13340, 1708. 
19 55,1; 67,5; 483, 55; 970, 124; 1165, 149; 2617, 335; 21283, 2725; 
35935, 4601. 
20 58, 2; 64,4; 302, 38; 674, 86; 1766, 226; 3968, 508; 24946, 3194; 
32272, 4132. 
25 83, 7; 100, 10; 161, 19; 205, 25; 527, 6%; 1181, 151; 1259, 161; 
2828, 362; 7420, 950; 9599, 1229; 16675, 2135; 21572, 2762. 
27 69, 3; 114, 12; 191, 23; 419. 53; 846, 108; 1899, 243; 3851, 493; 
8654, 1108; 15441, 1977; 34701, 4443. 
36 120,12; 246,30; 490,62; 1096, 140; 1954, 250; 4390, 562; 8904, 1140; 
20010, 2562. 
39 78, 0; 105, 9; 322, 40; 715, 91; 1447, 185; 1603, 205; 3250, 416; 
7303, 935; 26430, 3384. 
By Lemma 1, the table contains all primary solutions. The verification 
is straightforward. 
Moreover no solution contained in the table satisfies (2’). Thus Theorem 
2 is established. 
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THE PROBLEM OF MILLOUX FOR FUNCTIONS ANALYTIC 
THROUGHOUT THE INTERIOR OF THE UNIT CIRCLE.* 


By Maurice H. HEIns. 


1. Introduction. Let EF denote a point set of the z-plane lying in, and 
closed relative to, | z <1, having the property that for all F# satisfying: 
0=R <1 the intersection of Z with | z|—R is not void. A function f(z) 
will be said to belong to the class Fm, if f(z) is defined, analytic, and of 
modulus less than one for | z| <1 and if there exists an F such that ze H 
implies | f(z)| << m, where m is a given positive number less than one. Let 
M(f;7) have its usual significance as max | f(z)|. It is required to determine 


lu.b. M(f;r) (0<7r<1) and the associated extremal functions. 
Fm 

This problem is solved completely in the present paper; it will be shown 
that the extremal function suitably normalized is unique and independent of r. 
The normalized extremal function will be determined and expressed with the 
aid of the Jacobi theta functions. Two methods of obtaining the extremal 
function are indicated. One involves the theory of Blaschke products; the 
other gives an algorithm for determining the extremal function by the re- 
peated composition of (1,2) directly conformal maps of the interior of the 
unit circle onto itself. 

The methods developed are applicable to other problems. Analogues of 
Tchebycheff polynomials in the hyperbolic plane are defined and determined. 

The problem of the present paper is closely related to a series of in- 
vestigations initiated by Milloux [9], who showed that, if f(z) is analytic 
and of modulus less than or equal to one for | z| <1 and if there exists a 
Jordan are J lying in |z| <1, except for one endpoint on | z|—1, and § 
joining z—0 to the circumference | z|==1 with | f(z)| Sm (<1) for § 
zeJ, then there exists a positive constant k, independent of m and J, such that 


Lower and upper estimates for the best value of k have been given by Landau § 
[8]. Subsequently, E. Schmidt [14] considered the problem of Milloux under § 
the weaker hypotheses which assumed: (a) f(z) to be analytic and of modulus 
less than one for ze where § = [| z| <1]-CJ,J having the same connota- 


* Received January 11, 1944. 
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tion as above; - ue | f(z)| Sm, and showed the existence of a positive k, 
again independent of m and J, such that (1.1) held for the f(z) considered 
and ze. In fact, he showed the best value of & for the problem he treated 
to be 1/m and indicated that this was also the best value of & when f(z) is 
actually defined and analytic throughout |z| <1. The problem was later 
studied under broader hypotheses by R. Nevanlinna [12], Beurling [2], 
Fenchel [4], and Milloux [10], but always with the view to obtaining in- 
equalities having universal constants independent of m and the set H where 


lim | f(z)| Sm. 


It should be emphasized that the present investigation is concerned with 
functions analytic throughout the interior of the unit circle and deals with 
m fixed. 


2. Preliminary reductions. Let g(z) be defined, analytic and-of modu- 
lus less than one for |z| <1. Recall that g(z) can be represented * by 


where »(@) is defined and monotone non-increasing for 00S 2z, d is a 
real constant and z(z) is defined to be (a) identically equal to one if g(z) 


has no zeros for | z| <1, (b) the finite product 
n 
(2. 2) 
| | L— 
if g(z) has a finite number of zeros %,° - +, %n, multiplicities being counted 


as usual (if any % —0, then the corresponding factor is to be replaced simply 
by z), (c) the Blaschke product ? 


(2. 3) 
| | | 1 — az 

if g(z) has an infinite set of zeros %,, %,° - -, the same conventions prevailing 


here and throughout the discussion as in the case of the finite product (2.2). 

Suppose now that fo(z) « ¥» is extremal and that Ey is a corresponding 
E-set.2 By means of a trivial normalization it may be assumed that fo(r) 

This is essentially the representation due to Herglotz [6]. Cf. Evans [3] and 
R. Nevanlinna [11] for accounts of this representation. The use of the Poisson-Stieltjes 
representation is not necessary in this connection but is suggestive froma physical point 
of view. 

* An exposition of the theory of Blaschke products as well as of the products (2. 2) 
is to be found in Julia [7]. 

*The proof of the existence of an extremal function and an associated E-set is 
readily established and will be omitted. 
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=M(f.;r). The object of the present section is to show that, subject to this 
normalization, the set [—1< «=0] may be regarded as an associated E-set 
of fo(z). Define *(z) to be identically equal to one if f(z) is free from 
ZeTOS ; 


2.3 

if fo(z) has a finite number of zeros 2,° * -,2n; 

2.4 * = 

(2. 4) i+/als 

if fo(z) has an infinite set of zeros 2:,22,---. Now let h(z) denote 
1—z 1 (*. 

2. = 7* x ‘a 

(2.5) h(2) = exp { 


where »(@) has the connotation of (2.1) for fo(z). If ze Eo, then 


(2. 6) |h(—| 2|)| S| fo(z)|, 
and 
(2. 7) h(r) = fo(r), 


equality prevailing in (2.7) if and only if h(z) = fo(z). These facts may 
be readily ascertained by the use of standard inequalities for expressions of 
the form 


=~ a] <4) 


1— az 


and for the Poisson-Stieltjes integral (inequalities of the Harnack type). 
Inequalities (2.6) and (2.7%) imply, respectively, that h(z) «Fm with 
[—1<2=0] as an associated H-set and that h(z) is extremal. Since the 
strong inequality prevails in (2.7) if h(z) Afo(z), it follows that, if f,(z) 
is extremal, then h(z) =f,(z), and hence 


THEOREM 2.1. If fo(z) is extremal and is so normalized that M(fo; 71) 
=fo(r), then [—1< 240] its an associated E-set. 


8. The qualitative characterization of the extremal function. If f,(z) 
is extremal and normalized as in 2, then fo(z) is also an extremal function 
for the following problem. 


Prostem. Let f(z) be analytic and of modulus less than one for | z| <1. 
Further let it be required that | f(z)| Sm<1, m being a gwen positive 


— 


Ly 
of 


THE PROBLEM OF MILLOUX. 215 


number less than one, for z real and non-positiwe (—1< 250). Finally 
let r be real and lie strictly between zero and one. It is required to determine 


| f(r)! 
{f} 


and the associated extremal functions. 


This problem has been studied qualitatively by the author,* the pertinent 
results being summarized in 


THEOREM 3.1. If f(z) is so normalized that f(r) > 0, then there eaists 
an extremal function, ¢(z;m,r), and it is unique. It has the following 
properties : 


(a) $(2;m,r) is a Blaschke product ; 

(b) (2; m,r) has an infinite number of zeros, all simple and negative; 

(c) m,r) is real for z real; 

(d) on the real axis between successive zeros of ¢(z;m,r) there exists 
a unique point where 

(e) $(0;m,r) =m, ¢’(0;m,r) >0; 

(f) ¢’ has simple zeros at the points whose existence ts affirmed in (d) 
and has no other zeros; 

. (g) the zeros of ¢*>—m?* are z=0 and the points defined in (a), and 

there are no others. 


Let it be agreed that G[w(z)] will denote the Riemannian image of 
|z| <1, with respect to w(z), where w(z) is understood to be analytic for 
<1. 

From Theorem 3.1 the structure of @[¢] can be readily inferred. It 
follows from (f) that G[¢] has branch-points only over —m and m. Con- 
sider now the Riemann domain S lying over | w| <1 which is defined as 
follows. Let S; denote the open simple disk | w | < 1 slit along the real axis 
from — 1 to — m, and let S; (k = 2) denote copies of the open disk | w| <1 
slit along the real axis from —1 to —m and from m to +1. The Riemann 
domain S is constructed by joining the upper bank of the slit of 8, to the 
lower bank of the left-hand slit of S. and the lower bank of the slit of S, to 
the upper bank of the left-hand slit of S2, then by joining in the same fashion 
the banks of the right-hand slits of S. and S3, then the banks of the left-hand 
slits of S; and S,, and so forth. The Riemann domain, S, obtained in this 


‘Cf. M. H. Heins [5]. 
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fashion is simply-connected and bounded. Hence by the fundamental mapping 
theorem of conformal mapping there exists a function w= ©®(z;m) which 
is analytic and of modulus less than one for | z| <1, which satisfies the 
normalizing conditions: 


©(0;m) =m, ®’(0;m) > 0, 


and maps | z| <1 onto the Riemann domain 8. 
Consider now the relation 


(3. 1) ®[y(z) ;m] = m,7) 


where y(z) is normalized to vanish at z—=0. It is readily inferred from the 
structure of S==G[#] that there exists a function, ¥(z), which is analytic, 
single-valued, and of modulus less than one for | z| <1, which vanishes at 
z= 0, and satisfies (3.1) identically for all z in |z| <1. Next, from the 
structure of S and the normalization requirements imposed on ®, it may be 
inferred that ® is real and monotone increasing for z real and positive, and 
that ® is real and of modulus not exceeding m for z real and negative. Finally, 
y(z) is real and positive (in fact monotone increasing) for z real and positive. 
Hence ®(z;m) is a competitor in the extremal problem proposed at the 
beginning of this section. Also y(r) Sr (0 <r<1) where equality prevails 
if and only if ¥y(z) =z. From (3.1) and the enumerated properties of 
and y it follows that 


(3. 2) O(r;m) = o(r: mr) = S 


Hence = 1, ®(z;m) =¢(z;m,r), and S=G[¢]. One important con- 
sequence of this result is 


THEOREM 3.2. The extremal function $(z;m,r) is independent of r. 


For the remainder of the present paper the extremal function will be 
denoted simply by $(2;m). 

The information obtained about the structure of G[¢] will be very 
valuable for the explicit determination of ¢(z;m). An immediate application 
of the present results is a recurrence relation for ¢(z; m) which may be found 
as follows. Consider the function $*(z) defined by 

o(z;m) +m 


(3. 3) [¢*(z) $*(0) > 0. 


The function ¢*(z) may be continued to be analytic throughout | z| <1, 


since at the points where ¢(z;m) ——~m it is true that ¢$’(z;m) has a 
simple zero. The Riemann domain, ([¢*], has the same ramification proper- 
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ties as G[¢] except that the branch points are now over V2m/(1-+ m?) and 
— V2m/(1-+ m*). The conditions (3.3) imposed on ¢*(z) imply that the 
derivative of ¢*(z) at z—0 is positive. Hence $*(z)==$(z; V2m/(1-+ m’*) ) 
and (3.3) yields 


4, The determination of ¢(z;m) by a process of successive composi- 
tions of rational functions. In this section an algorithm will be developed 
for determining ¢(2;m) as the limit of a sequence of rational functions. One 
noteworthy feature of this algorithm is that each member of the sequence 
provides a majorant for ¢(r;m) (0 <r <1) and hence furnishes incidentally 
an upper bound for M(f:r) where fe Fm. 

Instead of considering ¢(z;m) directly, it will be convenient to ted 


o(z;m) —m 
1 — mp(z; m) 
The structure of the Riemann domain, G[w], is the same as that of G[¢] save 
that the ramification points of G[w] lie over 0 and —2m/(1-+ m?). Let 
p denote 2m/(1-+ m*). Define a) by the relation 


+h, 7]? o(z) +h 

] + (2) ) po(z) 

and the requirements: w,(0) = 0, > 0, being real and so determined 
that these conditions are fulfilled.’ It is clear that, subject to these con- 
ditions, /,* = and on differentiating (4.2) with respect to z that 


> (0) +h, _ 
“1+ho(0) [1+ [1 + p0(0)]? 


(4. 2) 


or 
2h, (1 — = (1— 


Hence k, >0 and is equal to Vu. With k, so specified, o:(z) is uniquely 
determined as a function analytic and of modulus less than unity for | z| < 1. 
A study of the Riemann domain, G[w,], yields the result that G[o,] has the 
same structure as G[w} but with its ramification points lying over 0 and 
—(2Vp/(1+p)); (2Vp/(1 +2) will be denoted by From (4. 2) 
it follows that ,(z) is real, monotone increasing and positive for z real and 
positive since w(z) itself has this property. But then from (4.1) and (4.2) 


5It is to be observed that this is essentially the device used by Koebe for estab- 
lishing the fundamental mapping theorem. 
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y+ 
(4. 3) +m 1+ 1— mp 


i—m) 
2 
1+ Vpo, 


and since, by Schwarz’s lemma, w:(r) <r for r real and positive, a rational 
majorant for ¢(r;m) is given by replacing w,(r) by r. Hence 


( r+ Ve 
M+ Vers 


1—m 
1+ Ver 


The process applied to »(z) to determine w,(z) may now be applied to 
w,(z) with Vm, 0:(2), o2(z) replacing p, ki, w(z), respectively, 
in (4.2) and the conditions w2(0) —0, w’.(0) >0 replacing the analogous 
conditions on w,(z). This process can be continued step-by-step indefinitely 
yielding a sequence of functions {w,(z)} and a sequence of positive constants 
{vx} =0,1,- - where 


(4. 4) d(r3;m) < 


(4. 5) (4 =0,1,-- -); 
worn + 7? oK + pk 
( ) ) 1 + ) 
and 
(4. 7) w,(0) = 0, >0 = 0,1,- -). 
Let 9x(Z) be defined by 
1+ 
4,8) (Z)= = k=0,1,: °°) 
( qx(Z) ( 
1+ 


and let Q.(Z) be defined by the relations 
(4.9) =Qealge(Z)] (kK = 1,2,° 


= 


nal 
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It is an immediate consequence of these definitions that 


(4. 10) o(2;m) = (k =0,1,° °°). 


An argument of the type used to deduce (4.4) yields the following sequence 
of majorants for ¢(r:m) (r real and between 0 and 1): 


(4. 11) d(r;m) < = (0,1,---). 


It is natural to conjecture that 


Qu(z) +m 
1 + mQi.(2) 


This may be proved from properties of Q(z) resulting solely from its defini- 
tion. From (4.9) it follows that the degree of Q:(z) is 2**%. Second, consider 
the image of the interval [— 0] with respect to Qx(x). Fork =0, 
the image of [— 0] with respect to Qo(z) is [—»SuS0]. This 
may be seen by observing that the minimum value attained by Qo(2) for real 


6(z;m) =lim for |z| <1. 


zis —yp and that this minimum is attained at x = — Ve which evidently 
satisfies the inequality 


+n <— Vp < 0. 
To obtain the image of [—peS 20] with respect to Q,(zx), recall that 
Q:(z) =Qolqi(z)] and observe that the image of [—p2S20] with 


respect to 9:(x) is [— pw. SX = 0]; this last remark follows by the argument 
used on Qo(x) and [—y, S20]. Hence the image of [—pweSrS0] 
with — to Q:(x) is [—p»SuS0]. By observing that the image of 
[— S with respect to qx(x) is [— pe S X S 0] and recalling the 
defining relations (4.9) the proof of the original assertion is established. 


To complete the proof of the conjecture, note that lim yz, —1 (this fact 
k->00 


follows from the elementary observation that {yx} is a positive monotone 
increasing sequence with limit »* = 1, and that p* = 2V w*/(1+ »*)). Let 
{Qxcn)(z)} denote any subsequence of {Qx(z)} which converges for | z| <1 
and let Q*(z) denote the corresponding limit function. Since for each k 
(k =0,1,2,---) the image of [—pri S20] with respect to is 


[—»SuS0] and since an pe = 1, the limit function Q*(z) must have 


the property that the image of [—1< 20] with respect to Q*(z) must 
belong to the interval [—p»SuS0J]. Hence the function 
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Q*(z) +m 
i + mQ*(z) 


must be a competitor in the extremal problem of 3. But then it is clear that 


Q*(r) +m — o(r;m), 


1+ mQ*(r) — 
and on the other hand (4.11) implies 


*(r) +m 
o(r;m)S i 
In other words, 


$(z;m) j + mQ*(z) 


Now the sequence {Qx(z)} does converge. If it did not, there would exist 
two subsequences with distinct limit functions. However the argument just 
applied to Q*(z) shows that any limit function must coincide with 


m) —-m 
1— md(z;m) 


Hence lim Qx(z) exists for |z!< 1 and stands in the indicated relation 
k->00 


to ¢(z;m). 
Ox(r) + 
4.1. If f(z) Fm, then M(f;r) < (k = 0, 1, 2, 
-). Further, the sequence {Qx(z)} converges continuously for | z| <1 and 


lim Q:.(2) m 


k>x 
k-00 

5. The representation of ¢(z;m) asa Blaschke product. Let B(z;m) § 
denote ¢(—2*;m). Clearly B(z;m) is a Blaschke product which is even 
and has all its zeros simple and real. Another consequence of the definition 
of B(z;m) is that | B(z;m)| has m as its maximum between successive 
zeros of B(z;m) (for z real). The Riemann domain, G[B(z;m) ], is com- 
pletely ramified (the covering is always twofold locally) over m and —~m. § 
Hence if a denotes the smallest positive zero of B(z;m), and if 7 denotes § 
the linear fractional transformation of | z| <1 onto itself with fixed points 
—1 and + 1, which carries — @ into «, then 


(5.1) 
The zeros of B(z;m) are therefore {T*a} (k =0,1,2,---;—1,—2,--:°). 


B(T;m) =— B(z;m). 
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For by (5.1) 7*« is a zero of B(z;m) for every integer k. Nor can there 
be any other zero since such a zero would have a homologue with respect to 
some integral power of 7 strictly in the open interval (—«,a) and this is 
contrary to the definition of « It now remains to determine @ and T in 
terms of m. 

The transformation 7’ may be expressed in the form 


Tz—1 z—l1 


(5. 2) (A real, O<A< 1). 
The condition, 7 (— %) a, implies that = Hence T* is given by 
i=)" 
(5.3) \ita z-+1’ 
or 
1 2)" 
1+2 
(5. 4) 1— a \** 
1+2 


Since B(z;m) is an even Blaschke product which is positive for z= 0, it 


must necessarily be given by 


(5. 5) B(z3;m) = 
k 


The value of @ is determined by the condition B(0;m) =m. In other words, 


the relation 


(; 2 
(5.6) mon (Te)? 
k=0 k=0 1 


must hold. It is readily verified that there exists a unique real « lying between 
0 and 1 which satisfies (5.6) for a given positive m less than 1. This may 
1+a 
carries the interval (0 < « <1) into itself and that the infinite product 


oo — 
(7) I 1 
is analytic for | +! < 1 and defines a monotone strictly decreasing map of the 
interval (0 < + <1) onto itself. As a matter of fact, the inversion of P(r) 


be deduced by observing that the linear fractional transformation « 


| 1s a classical problem in the theory of theta functions. 
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To recast the present problem in terms of the theory of theta functions, 
recall the following definitions and identities.° The functions and @;(w) 
with parameter 7 (| 7| <1) can be defined by 
(5.7) 6,(w) = 1 — 27 cos + 27* cos 4ru — 27° cos 6ru +° 

=1-+ cos 2xu + 2! cos 4ru + 27° cos Gru 
For each + satisfying | + | < 1 the functions @,(u) and 6;(u) are entire in u, 
For the present purposes the following identities are significant: 

ee) 
6,(u) =C (1 — 27°*** cos 2au + ; 
k=0 


(5. 8) 
(u) =C II (1 + 27r7** cos + 


where C is independent of z and is obviously not zero—its exact value is of no 
concern for the problem at hand—and 


64(0) —cTl (1 —— 
(5. 9) 
6,(0) = cil (1 2, 
From (5.6) and (5.9) 
(5. 10) m = 0,(0) /03(0) 


where the parameter 7 is equal to (1—«a)/(1-+). The formulas (5.8) 
will be employed in 6 to represent ¢(z;m) in terms of the functions 4,(w) 
and 6;(u). 

With « assumed determined as a function of m, ¢(z;m) may be ex- 
pressed immediately as a Blaschke product. The definition of B(z;m) and 
the formula (5.5) imply 


(Tia) 
(5. 11) $(2;m) = i + 


is monotone 


(Pha)? + 2 
Note that the sequence {¢y(z)}, where ¢y(z) 


decreasing for any z real, positive, and less than 1. Hence ¢y(r) furnishes a 
rational majorant for M(f;r) where f ¢ Fm for N =0,1,2,---. By way of 
comparison with the methods developed in 4, it is to be observed that the 
determination of the coefficients of ¢y involves the transcendental operation of 
computing « in terms of m, whereas the coefficients of Qx(z) may be found 
by éssentially “ algebraic ” operations. 


* Following the notation used by Whittaker and Watson {16]. 


| 
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6. The representation of ¢(2; m) in terms of theta functions. With 
t= (1—a)/(1+ 4) formulas (5.4) and (5.11) imply 


— \2 
co +2 
(6. 1) o(2;m)= 1] 
k=0 1+ 
or 


(1 + 2) + —- + +z) 


1+ 2 
and by (5. 9) 
(6. 2) m) = 64(u) /O3(u) 


where + is the parameter of both @(u) and @,(u) and cos 2ru 
= (1—z)/(1+2). 


7. A hyperbolic analogue of the Tchebycheff polynomials. It has 
been observed in 5 that G[B(z;m)] is completely ramified over m and —m, 
having only branch points of order one over these points. Recall the structure 
of the Riemannian image of the finite z-plane with respect to w = cos z, which 
is completely ramified over + 1 and —1 with branch points of order one, and 
which is locally simple over every other point of the finite w-plane. On the 
strength of the analogy between the structure of G[B(z; m) ] and the Riemann 
surface for cos z, it is natural to conjecture that in many respects B(z; m) 
plays much the same role in the theory of functions which are analytic and 
bounded in the interior of the unit circle that cosz plays in the theory of 
entire functions. The similarity of the roles played by B(z;m) and cosz 
will be brought out in two extremal problems to be considered. The first is 


ProspteM. To determine among all (1,n) directly conformal maps, 
tn(z), of | z| <1 onto itself those for which 


max | mn(z)| 
-rSrsr 


ts least and the corresponding écart. Here n 1s a given positive whole number 
and r is a given positive number less than one. 


9-2k+1 1—z 4k+2 
) 
) 
) 
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This problem is a hyperbolic analogue of the problem of determining the 
Tchebycheff polynomials from their extremal properties. 

The first question that must be settled is that of existence. Let p(n) 
denote g.l. . [ max | wn(z)|]. The class of functions consisting of all con- 

stants : an one and all directly conformal maps of | z| < 1 onto itself 
which are of degree not exceeding n is compact. Hence it may be shown by a 
familiar argument that there exists a member of this class, 7*,(z), of degree i 
(0=k=n) for which 
(7.1) max | x*,(z)| p(n). 
The degree & cannot be less than n. Otherwise the function 2"*x*;(z) would 
belong to the class {7n(z)} and would be such that 
max | < p(n). 

Hence there exists an extremal function in the class {rn(z)} for the present 
problem. A typical extremal function will henceforth be denoted by 7*n(z). 

Qualitative properties of a typical extremal function may be readily 
determined. Of central importance among such properties for determining 
p(n) and the extremal functions themselves are: 1° the location and the 
nature of the zeros, 2° the location of the points on (—r=2zZ=r) where 
|r*,(z)| attains p(n). Recall that a directly conformal map of 
| z| <1 onto itself may be represented by 


(7.2) =e (6 real; | a; | <1, &==1,2,---,n). 
k=1 1 — 


With the aid of this representation it can be shown that all the zeros of 7*n(z) 
are real. This will be demonstrated by using the inequality 


| 
1— (R2)z 


1— ae 


< 


(| Ra|<|a|<1) 


(7.3) 


where = is real and satisfies —1<2< 1. The inequality may be established 
by direct analytical verification or else by noting that the non-Euclidean circle 
centered at z and tangent to the Euclidean straight line @z—= 0a does not 
contain « in its closed interior. On referring to the representation (7.2) one 
can see that by virtue of (7.3) if any zero of 7*,(z) is not real, u(m) would 
be diminished if that zero were replaced by its real part. 

Furthermore, every zero of 7*,(z) must lie in (—r<2<r). Suppose 
that a, a zero of 7*,(z), satisfied |a| =r. Then, if « were replaced by « = pa 
where p is real and less than but near one, the value of »(m) would be 


THE PROBLEM OF MILLOUX. 225 


diminished. Also the zeros of z*,(z) are all simple. To see this suppose that 
B is a multiple zero of z*n(z). Let and satisfy: 1° —r<pBi< B< po 
<r; 2° max — Bi)/(1 — fiz) — — Bot) < p(n); 


Bi<a< Ba 


3° (B — B:)/(1 — BB:) = (Bz — B)/(1 — BB2). If the alleged factor, 
((z — B)/(1 — Bz))*, of w*n(z) were replaced by (2 — — fiz) 
-(z— Bz) /(1— Bez), the resulting function would be such that its écart on 
the interval (—r=2Sr) would be less than p(n). This is certainly true 
for 8B: Sz Bz by the choice of 8, and Bz. On the other hand, for those z 
which satisfy either —r Sz < f; or B2 < Sr the readily verified inequality 


(7. 4) toa 


prevails. Hence the écart on the interval (—rS zr) of the function 
replacing 7*,(z) would be less than p(n), and this is contrary to the definition 
of p(n). 

If x*,(z) is subjected to the normalization, 7*,(1) =1, then z*n(z) is 
real for z real. Since *,(z) is of degree n, all the zeros of dx*,/dzin|z| <1 
are accounted for by the points of relative maxima of | rn | between successive 
zeros Of z*,, there being precisely one zero of dx*,/dz between successive zeros 
of The value of | 
minima as well as at z==-—r, ris Suppose, for example, that 
< p(n) (it should be observed that 7*n(7) > 0 by virtue of the normalization 
to which z*,(z) is subjected) and let the zeros of z*n(z) be denoted by 


at these positions of relative maxima and 


@1, %o,° * Where 


If a, were replaced by a’, in 7*,(z), where @’» is real, less than and sufficiently 
close to %, then the resulting function would be such that the maximum of 
its modulus on the interval (—r= zr) would be actually less than p(n) 
and this is impossible. A similar argument would apply if | 7*,.(—7r)| 
<p(n). There remains for consideyation the relative maximum of | 7*n(z)| 
between and (k& Let yx denote the zero of dr*n/dz 
lying between a and %,. (k =1,2,- -,n—1), and suppose that | +*n(yx) | 
< p(n) for some index ko. Replace and by ox, and respectively 
in r*,(z), where o;, and satisfy the following conditions: 1° << a%, 
3 2° Oy, so close to and is so close to that the 
maximum of the modulus of the replacing function for @;,, << @ < @”ky41 is less 
than p(n); 3° the non-Euclidean distance between ax, and a, is equal to the 
non-Euclidean distance between o., and @%.1. The effect of 3° is that 
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/ 


(7. 5) 0< 


for —1l<2r< oy, and @ << The function replacing 7*n(z) under 
this change of zeros would be of modulus less than p(n) for —rSe¢Sr, 
which is impossible. To sum up, 


THEOREM 7.1. There exists a (1,n) directly conformal transformation 
of |z| <1 onto itself such that the maximum of its modulus on the interval 


(—rSzsr) (0<r<1) is equal to p(n) =g.1.b.[ max | mn(z)|] where 
{an(z 


mn(z) denotes a (1, n) directly conformal map of | z| <1 onto itself. Further, 
if w*n(z) denotes an extremal function for this problem so normalized that 
n*,(1) =1, then r*n(z) has the following properties: 

1° x*,(z) is real for z real; 

2° the n zeros of r*n(z) are all simple and lie in (—r<a@r<r); 

3° =p(n), | r*n(—r)| p(n), the marimum of | 7*n(z)| 
between successive zeros of x*,(z) is equal to p(n), this value being attained 
by | r*,(x)| at r, -—r, and the zeros of dx*n/dz; 

4° the zeros of dx*,/dz are simple, and there is precisely one zero of 
dr*,/dz between successive zeros of r*n(z), and there are no others. 

It should be remarked that | 7*,(z)|—=(n) only at those points enu- 
merated in 3° since 7*n(z) is of degree n. The structure of the Riemannian 
image of the extended plane with respect to w = 7*,(z) is completely specified 
by Theorem 7.1. Since the derivative of 7*n(z) at z= 1 is positive, it follows 
from the uniqueness theorem of the theory of the conformal mapping of simply- 
connected Riemann surfaces that 7*n(z) is uniquely defined by its normaliza- 


tion and its extremal property. Hence 


THEOREM 7.2. The normalized extremal function of Theorem 7.1 1s 
unique and hence is odd if n is odd and even if n is even. All the extremal 
functions are given by e*r*,(z) (6 real). 

There remains to be considered the determination of p(n) and 7*,(z). 
They may be found by the following argument.” Consider the function (2) 
defined by 
(7.7) =n*,[B(z;7r)]. 


An analysis of @(z) reveals that ®(z) = B[z; p(n) ]. 


7It is readily seen that w(1) =r and *,(z) =z. It will be assumed for the 
remainder of this section that n > 1. 


1 is 


mal 
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This may be seen by first noting that B(0;r) =r implies ®(0) = p(n). 
Second, the structure of G[®] may be readily inferred from the structures 
of G[B(z;r)] and G[x*,]. From the definition of B(z;r) as ¢(—2;r) 
(5), G[B(z;r)] may be constructed with the aid of a set of simple open 
disks, Sz, (4 =0,1,2,- - - ; —1,—2,- - -), where each S; consists of a copy 
of |w| <1 slit along the real axis from —1 to —r and from r to 1. In 
order to obtain G[B(z;1r)] the upper and lower banks of the right slit of S 
are joined to the lower and upper banks respectively of the right slit of Sx.1, 
and the upper and lower banks of the left slit of S; are joined to the lower 
and upper banks respectively of the left slit of Sy, for k—=0,+2,+ 4, 
‘++, the Riemannian domain so obtained being G[B(z;r)]. From the 
descriptive properties of w*,(z) developed in the present section G[x*n] 
may also be constructed. Let o, (A=1,--~+,n) denote a copy of the open 
disk, | w | <1, slit along the real axis from —1 to —p(n) and from p(n) 
to 1. Then G@[x*,] may be formed from the oy as follows. The upper and 
lower right banks of o; are joined to each other; then the upper and lower 
left banks of o, are joined to the lower and upper banks respectively of o>. 
If n= 2, the upper and lower right banks of o2 are joined to each other. 
If n > 2, then the upper and lower right banks of o2 are joined to the lower 
and upper banks respectively of os, etc. In on one slit remains and the upper 
and lower banks of this slit are to be joined together. If m is odd, it is the 
left-hand slit, otherwise the right-hand slit. The validity of this construction 
is guaranteed by the general properties of (1,) directly conformal maps of 
the interior of the unit circle onto itself and the particular ramification proper- 
ties of 7*,(z) already enumerated. 

It follows from this discussion that the Riemannian image of each S; with 
respect to x*,(z) consists of a copy of G[x*,] with o; slit from p(n) to 1 
and o, slit from »(7) to 1 if n is even, or from —1 to — p(n) if n is odd. 
Hence if G®[x*,] denotes the Riemannian image of S; with respect to 
n*,(z), G[®] may be constructed in terms of the G™[x*,] by connecting the 
latter along their slits in the following manner. Let it be agreed that the 
right slit of o,°°) of having the connotation for [x*,] 
that o, has for G[x*,]) is to be connected to the right slit of o,“) and that 
the as yet unconnected slit of on‘ be connected to the corresponding slit of 
on), upper and lower banks being connected with lower and upper banks 
respectively. The next step is to connect the available slit of on" to the corre- 
sponding slit of o,‘°) and the right slit of 0,“ to the right slit of o,‘*), again 
with the same type of connection between upper and lower banks. The process 
is to be continued in this way step-by-step. In this manner it is seen that 
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G[®] and G[B(z;p(n))] are identical. Thus, since (0) = B(0;,(n)), 
= B(ez;u(n)) real). Now (mod z) ; this may be established 
by noticing that the zeros of ®(z) are to be found from those of B(z;p(n)) 
by multiplying those of the latter by e~*’; all the zeros of ®(z) and B(z; n(n) ) 
being real, the assertion follows. Since B(z;u(n)) is even, ®(z) =B(z;p(n)). 

The following argument determines »(m) and the zeros of r*,(z). Let 
T, denote the linear fractional transformation of | z| <1 onto itself asso- 
ciated with B(z;r) as T is associated with B(z;m) in 5 and let A, denote 
the associated multiplier (cf. (5.2)); similarly let Tun) and Aycny have the 
corresponding connotations for B(z;(n)). From (7.7) it follows that the 
cyclic group generated by 7, is a subgroup of the cyclic group generated by 
Tun). Hence for some positive integer Actually k =n. 
This may be seen by noting that B(z;(n)) has precisely & simple zeros in 
the interval (0<@2< T*yn)0) or, what is the same, in the interval 
(0<2<T,0). Now B(x;r) is monotone decreasing for (0S 2=T,0) 
and B(0;r) =r and B(T,0;r) =—r. Since the n zeros of w*,(z) are 
located in the open interval (—r,r), x*,[B(x;r)]=B(z;p(n)) has n 
simple zeros in (0<4< 7,0). Hence kn. A consequence of this result 
is that = On reference to (5.6), the value may be obtained 


at once. It is 


00 1 — A, (2k+1) 2 
(7. 8) p(n) If [| of | 


Also the Blaschke product for B(z;(n)) may be written down at once with 
the aid of (5.4) and (5.5). 

The extremal function, 7*,(z), will be “specified ” once its zeros are 
given. These may be found from the identity already employed: 


(7. 9) B(z;p(n)) 
On the interval (0< 7,0), B(z;pu(n)) has n simple zeros. In fact, 
they are =1,2,---,n). Since B(z;r) is (1,1) on the 


interval (0S 2ST,0), the points B[T'*/"0;r] (k=1,2,---,n) are § 
distinct, lie in the interval (—r<a2<r) and hence by (7.9) are the zeros § 
of x*,(z) and there are no others. The argument is complete. 


Remark I. This discussion may be paraphrased for the theory of ordinary 
Tchebycheff polynomials with the starting point of the discussion being the 
definition of the Tchebycheff polynomial as one which enjoys the extremal 
property usually associated with it rather than the ad hoc definition 


T n(x) = cos(n are cos 
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Remark II. It should be noted that the class of competing functions 
{x*,(z)} is not convex. Hence the useful property of convexity available in 
the theory of ordinary Tchebycheff polynomials may not be employed here. 
The full force of the proof is to be found in the representation (7.2) of a 
competing function, in the associated discussion of the location of the zeros 
of an alleged extremal function, and in the ramification properties of the 
mapping it defines. 


8. An extremal problem for functions analytic and of modulus less 
than one in the interior of the unit circle, which vanish at the origin and 
have an assigned derivative there. The second extremal problem alluded 
to in 7 is: 


PROBLEM. Let Hq denote the class of functions, f(z), which are analytic 
and of modulus less than one for |z| <1 and satisfy the conditions: 
1° f(0) =0, 2° f’(0) =a, where « is an assigned real positive number less 
than one. Further let p(a) f(z)|]. Lo determine p(a) 

feEa -1<@<1 


and the corresponding extremal function.® 

This problem may be solved in terms of the function B(z;m) by a pr- 
choice of m. Since the solution of the present problem calls for many details, 
the major steps are outlined. 


I. There exists an extremal function for this problem. This may be 
inferred from the compactness of the class Hg. Let $(z) denote a typical 
extremal function. Actually it will turn out that there is a unique extremal 
function. 

II. | ¢(z)| attains p(a) for some z in the open interval of the real azis 
(—1<72<1). 

III. In fact, | p(z)| attains w(«) an infinite number of times on the 
interval (—1<4<1). 

IV. The extremal function is unique. From this stage of the solution on, 
¢(z) is unambiguous in connotation. 

V. (2) is real for z real and ts odd. 

VI. All the zeros of $(z) are real. 

VII. At this juncture it is convenient to introduce an auxiliary extremal 
problem. Let (a, 7) = 1. b. [ max | f(z)|], and let it be required to 

eLa 


determine properties of the associated extremal function. (The problem of 
determining »(#,r) and the corresponding extremal function explicitly is 


® Cf. Bernstein [1] where a related problem for entire functions is treated. 
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difficult but its solution is not essential for the solution of the major problem 
at hand.) The following properties of the extremal function for this sub- 
sidiary problem are listed without proof.° (a) From the compactness of Ea, 
it follows that there exists an extremal function. (b) The extremal function 
is unique. It defines a (1,k(r)) (k(r) a positive integer) directly conformal 
map of |z| <1 onto itself. Let it be denoted by w(z;7r). (c) (231) ts real 
for z real and is odd. (d) The zeros of w(z;1r) are real and simple. (e) At 
most one zero of w(z;r) lies to the right of x =r and at most one zero of 
w(z;7r) to the left of r——r. (f) p(a,1r) is attained on the real axis by 
|w(z;r)| between successive zeros of ¥(z;r). (g) The antecedent of the 
closed interval with respect to w= y(z;r) is a subset 
of (—1<24<1). (h) is monotone non-decreasing for 0 <r<l 
and lim (a, r) =p(a). Hence by IV limy(z;r) =¢(z) in the sense of 
rl 


continuous convergence for |z| <1. 

VIII. From VII (g) and (h) it may be proved that the antecedent of 
the closed interval [—p(a),p(a) | with respect to w= (2) is 1). 

IX. At each point of (—1<x<1) where |¢|—p(a), ¢’ has a zero 
of order one. 

X. From IX it is concluded that ¢(z) = B[T+4z;p(a)] where T has 
the meaning previously associated with tt. 

XI. w(a) is determined in terms of « from the relation $’(0) =a. 


The proof of I will be omitted inasmuch as the argument is standard. 
To establish II the auxiliary function, »(z; 8), is introduced, being defined by 


(8.1) n(2; B) = ze8 (+3) /(2*1) (8 real and positive). 


With this restriction on 8, 7(z;) is analytic and of modulus less than one 
for | z| <1 and in addition has the property that lim n(z; 8) = lim n(2; £) 


=), where x is assumed real and of modulus less than one. If B is taken 
equal to —log a, then Let it be assumed for the immediate 
argument that this is the case. The statement II may be established as follows. 
Suppose that »(«) is not attained by | ¢(z)| on the open interval (—1 <2 
<1) and let r be chosen positive and less than, but so close to, one that for 


az real and satisfying r= <1 


(8. 2) | n(x: B)| Sm(«)/2. 


® The proofs of these properties may be supplied by applying the methods of §7 
of [5]. Since the proofs of the present results are so closely related to those given 


there, they will be omitted. 


f §7 


riven 
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Clearly max | p(«). Now consider 
(8. 3) = (1—A)d(z) + An(z; B) 


for A real and satisfying 0=2A=1. For such dA the function (8.3) belongs 
to Ford positive and sufficiently small, ®(z; A) satisfies (i) max | 


<p(a); (ii) Lub. | (a real). Hence Lub. | A)| 


-1<-2<1 
< p»(«) and this contradicts the assumption that $(z) is extremal. II follows. 
The assertions III and IV may be established by a combined argument. 


Let {xx} denote the set of points on (— 1 < # < 1) for which | $(2x)| = p() 
and consider the Pick-Nevanlinna interpolation problem ?° of determining the 
functions, g(z), which are analytic and of modulus less than one for | z| <1 
and satisfy in addition 


(8. 4) g(%) =¢(ax), g(0)=0 


for all indices & considered. The following inferences may be drawn from 
the Pick-Nevanlinna interpolation theory. The case where the set {2x} is 
finite will be treated first. If the function g(z) is uniquely determined by 
the interpolation conditions (8.4), then it is rational and defines a (1, K) 
directly conformal map of | z| <1 onto itself for some positive integer K. 
This is impossible because then ¢(z) would have the property 
(8. 5) lim | ¢(z)| =1 

which is contrary to its definition as an extremal function for the major 
problem under consideration. 

If g(z) is not uniquely determined by (8.4), then the possible values 
of g’(0) for the class of g(z) considered fill a closed circle containing «. Let 
p (<1) denote the largest positive number in this circle. Clearly, «Sp. 
There is a unique member of the class of g(z), say go(z), which satisfies the 
further condition that g’9(0) =p; go(z) defines a (1, K) directly conformal 
map of |z|<1 onto itself. The assumption, «—p, is untenable by the 
argument used above. If « < p, take B in y(z;8) equal to —log a/p. Then 


- the function 


(8. 6) = go(z)n(z; B)/z 
belongs to #, and is such that for the 2; considered 
(8. 7) | (xx) | < p(a). 


19 Cf. Walsh [15], R. Nevanlinna [13]. 
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Therefore for sufficiently small and positive, (1 —A)¢(z) + Ad¢*(z) belongs 
to and satisfies 

which contradicts the extremal character of ¢(z). Therefore an extremal, 
$(z), satisfies III. 

The situation where the set {2} is infinite remains to be considered. 
Again, if g(z) is not uniquely determined by (8.4), the set of possible values 
of g’(0) fills a closed circle, containing a. Let p* (<1) denote the largest 
positive number in this circle. The possibility that « < p*, is excluded by the 
argument applied to the case where the set {2%} is finite. Hence « = p*, and 
this implies, by the Pick-Nevanlinna theory, that g(z) is uniquely determined 


by (8.4) and the condition g’(0) «a. There remains only the case where f 


g(z) is uniquely determined. At all events the only function analytic and 
of modulus less than one for | z| <1 which satisfies (8.4) and g’(0) =a 
is itself. 

Now suppose that #(z) is not unique and let ¢,(z) and ¢2(z) denote two 
distinct extremal functions of class Hq. Clearly, [¢:(z) + ¢2(z)]/2 also 
belongs to H,, is extremal, and is distinct from ¢; and dz. Let {X;.} denote 
the infinite set of points on (—1 <2 <1) for which | ¢, + ¢2 |/2 =p (a). 
At such points $; = (4: + ¢2)/2 =—¢2. Hence by the result of the pre- 
ceding paragraph 


which is impossible ; uniqueness is established. 


Assertion V is readily proved by noting that along with ¢(z) both § 


and —¢(—2) are extremal. 


Assertion VI may be concluded in the same manner as VII (d). Thef 


proof is readily supplied. 


Assertion VIII may be established as follows. Suppose that there exists a 
point z of | z| <1, not on the interval (—1<2< 1), with ¢(z) in thef 
,1— | Zo |] a point € such that #(€) would lie in the closed 
interval [— »(«,1r), 7) ] for all less than and sufficiently near one. But 


then by Hurwitz’s theorem, for a given neighborhood of {, say N(¢), and for§ 


closed interval [—p(@),p(a)]. Then there would exist in | z—2%| 


< min [! Iz 


r sufficiently near one, v¥(z;7) would attain the value ¢(f) for some z in 
N(f) and this manifestly contradicts VIT (g). 


As for IX, the points of (— 1 < x <1) where | ¢(x)| —p(a) are pointsf 


an 


pr 


| 
( 


ngs 


both 


The 


ists a 
n the 
— 
losed 

But 
for 
2 iD 


points 


co 
| Hence a [7*0]*?. But (7*0)? = ( 
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where (2) has either a relative maximum or minimum, ¢(x) being con- 
sidered as a real function of the real variable z At such points ¢’ has a zero 
of order one as a consequence of VIII. 

Step X is decisive in completing the proof. Let b(z) denote B( Tz; (a) ) 
and consider the equation 


(8. 10) =0). 


Since 6(0) = 0, it follows that (8.10) defines y(z) as a single-valued analytic 
function of modulus less than one for |z| <1, since G[¢] is completely 
ramified with branch points of order one over p(a) and —yp(a). Upon 
differentiation, (8.10) yields 0’(0)y’/(0) =¢’(0). By Schwarz’s lemma 
|y(0)| S 1. Hence 0 < « = ¢/(0) S 0’(0). It is impossible that 
a< b’(0), since then («/b’(0))b(z) would belong to Hq and yet for real z, 
| (%/b’(0) )b(z)| S (a/0°(0)) This would imply that 4(z) 
is not extremal. The refore since « = b’(0), it follows that $(z) = b(z). 

There remains to be considered the determination of »(@) and the multi- 
plier of 7’ in terms of « (XI). These questions may be treated by much the 
same methods as those used in 5. As in that section let T be given by 


| z—l1 


Then b(z) may be represented by 
co (T*0) 42 
| (8.12) b(z) == 211 i— 


e—fi 
It is readily seen as in 5 that there exists a unique A (0 <<A< 1) satisfying 


(8.13). With A so determined, 4(z) is given by (8.12). The extremal value, 
u(«), is b(T40). The solution of the problem is complete. 


). It follows that 


Remark. Related extremal problems may be treated in the same spirit. 
For example, one may consider the problem of determining 


[lub. | 


Ea 


and the associated extremal function. The problem treated in this section 
and the one just alluded to are of interest inasmuch as their solutions yield 


precise information on the magnitude of a function of class HZ, either on a 


nal, 

red, 
lues 
cest 
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é 
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also 
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diameter of the unit circle or a ray issuing from the origin. Thus an easy 
corollary of the present discussion is 


Corottary. If f(z) is analytic and of modulus less than one for | z| <1, 
and if further f(0) =0, | f/(0)| =« (0<«< 1), then along each diameter 
of |z| <1 there exists a point at which |f| 2p(a). If along any diameter 
|S then f(z) (et%z), where 6, and 62 are real constants. 
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THE DENSITIES OF IDEAL CLASSES AND THE EXISTENCE OF 
UNITIES IN ALGEBRAIC NUMBER FIELDS.* 


By AuREL WINTNER. 


According to a terminology customary in the analytic theory of numbers, 
the “depth” of an asymptotic law concerning integers or ideals is measured 
in terms of the (specific) arithmetical information needed in the proof. 
Correspondingly, the varying degree of complication in the (general) ana- 
jutical theorems of the Tauberian machinery involved is not taken into 
xccount by this principle of classification. In this technical sense, the prime 
number theorem is deeper than Mertens’ asymptotic formula, not because a 
Tauberian theorem relevant for the former requires a proof which happens 
to be longer than the proof of a Tauberian theorem sufficient for the latter, 
but merely because the former does, and the latter does not, depend on a cer- 
tain arithmetical information; namely, on one involving the location of the 
zeros of f(s). 

For the “elementary” laws of algebraic number fields, it is natural to 
consider a corresponding principle of classification, by isolating the theorems 
which ean be proved without involving the arithmetical fact represented by 
the existence of a complete system of unities or of an associated lattice. In 
particular, the Dirichlet-Dedekind theorem, concerning the ergodic distri- 
bution of the integral ideals over the various ideal classes of the field, appears 
to be less “deep” (that is to say less arithmetical) than an asymptotic esti- 
mate involving such results as Minkowski’s theorems on discriminants. How- 
ever, it is another matter to prove that such is the case. In fact, the classical 
proof of the law of Dirichlet-Dedekind depends on the arithmetical theorem 
concerning the existence of a lattice. 

Thus it might be of interest for algebraists that they are overestimating 
precisely the “algebraical depth” of the Dirichlet-Dedekind theorem, since 
the theorem on the existence of unities (lattices) is superfluous in the proof 
jor the existence and the equality of the densities of the various ideal classes 
in an arbitrary (finite) algebraic number field. 

The resulting proof is not recommended for class-room purposes. It cer- 
tainly is not simpler than the classical proof. However, it would be about as 
ineaningless to state that the latter is simpler than the former. In fact, one 
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could say that the classical proof is “simpler” only at the cost of introducing 
the deep, that is to say arithmetical, circumstance that there exists in the field 
a complete system of unities, transforming the problem into what Minkowski 
calls a macroscopic volume-evaluation. But it turns out that, historically, 
this rdle of the theory of the unities was that of a deus ex machina (even 
though, according to Furtwiingler, the theory of algebraic numbers can to-day 
be developed so as to be based on a lattice). 

it seems to be of interest that the legitimacy of this methodical point of 
view is fully supported by the historical development. In fact, when Dirichlet 
introduced unities into his (quadratic) fields, his sole purpose was the proof 
of the class-number formulae. But the latter are identical with what results 
if the Dirichlet-Dedekind relations of an arbitrary field are particularized to 
the case of a quadratic field. In Kummer’s adaptation of Dirichlet’s theory to 
cyclotomic fields, the réle of the unities appeared in a form less pure, since it 
was interwoven with the algebraic affairs of Fermat’s problem. However, in 
Dedekind’s extension of Dirichlet’s work to arbitrary fields, the introduction 
of the unities served again no useful purpose distinct from the one for which 
Dirichlet originally devised it in his particular case. For the modern algebraist 
who cares for generalized theories of unities for the unities’ sake, this may 
not be easy to believe, but he can convince himself by turning over Dedekind’s 
supplements [1] or Minkowski’s historical sketch [3]. 

‘The ironical twist of historical development, which first makes mathe- 
matics devise a theory for a definite purpose and subsequently leads to other 
concrete ends (this time to Minkowski’s geometry of numbers) but ultimately 
proves that the introduction of the theory was superfluous exactly for the 
original purpose, is not of course an uncommon occurrence. That it presents 
itself in case of the Dirichlet-Dedekind theorem, can be proved as follows: 


With reference to any ideal class C of a fixed algebraic number field 
K = K(a), let f(n;C) denote the number of the integral ideals which are 
contained in C and have a norm not exceeding n, where n = 1,2,---. Then 
the assertion to be proved is that the ratio f(n;C)/n tends to a finite, non- 
vanishing limit, say A; and that, in addition, this limit A Ax is independent 
of the choice of C in K. 

Let F(n;C) denote the number (= 0) of the representations of n as 
norms of (integral) ideals contained in C. Then it is obvious that 


(1) f(n;C) = F(1;C) + P(2;C) +: F(n;C) 


and that, if £(s;C) denotes the zeta-function of C, the series 


THE DENSITIES OF IDEAL CLASSES. 
oOo 

(2) {(s;C) F(n;C)/n 
n=1 


converges in the half-plane o >1, where s=o-+it. It follows that the 
existence of a positive value A = Ag satisfying 


as o>1+0 


is a necessary condition for the truth of the assertion. In fact, Dirichlet’s 
Abelian lemma states that, if a,@2,°** is any sequence of values possessing 
a finite mean-value 
(3) M(a) = lim (a4, + an)/n, 


then the Dirichlet series 
oO 


(4) dn/ns 


n=1 


must converge for o > 1 to a function satisfying 


(o—1) as 


n=1 


or 


On the other hand, the expression on the left of the limit relation (5) 
can tend to a limit and, what is more, the function represented by (4) in the 
half-plane « > 1 can have a simple pole at s = 1, even though the limit ¢3) 
fails to exist. This is shown by a well-known example of Dedekind, redis- 
covered by Hardy, and subsequently used by Ramanujan for a mistaken proof 
of the prime number theorem in a manner the illegitimacy of which was 
verified by Dedekind precisely on this example; cf. [5]. It is true that the 
coefficients F'(n;C) of (2) are non-negative, but this is of no avail, since the 
restriction dm = 0 is satisfied by Dedekind’s’ counter-example also. 

However, as a corollary of Hecke’s functional equation, the function 
(s—1)€(s;C) is regular in the whole plane, and the residue of {£(s;C) 
at the point s = 1 has a positive value, say A Ax, which is independent of 
the choice of C in K. This information is available for the purpose at hand. 
In fact, while Hecke’s original proof did make use of the existence of a lattice, 
he subsequently observed that his functional equation can be proved without 
any reference to the theory of unities, and such proofs are now available in 
the literature. For older references cf. [4]. 

Actually, neither the functional equation nor the regularity of the 
difference 


(6) —A/(s—1) 
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in the whole plane will be needed in the sequel. All that will be used is that 
the function (6), where o > 1, goes over into a continuous boundary function 
on the line o—1. 

In fact, this in itself makes applicable Ikehara’s theorem, which is a 
particular case of a purely analytical result concerning Laplace transforms 
and does not presuppose any arithmetical information; cf., e. g., [2]. Ikehara’s 
theorem states that, if a,,a2,--- is a sequence of real, non-negative values, 
and if the corresponding Dirichlet series (4) converges in the half-plane 
o > 1 to a function corresponding to which there exists a constant A such 
that the difference 


co 
(6 bis) An/n* —Ar/(s—1), 

n=1 
where o > 1, goes over into a continuous boundary function on the line o=1, 
then 


as n—> oo. Hence, if (4) is identified with (2), it follows from (1) that 
f(n;C) ~An. 
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SOME RELATIONS BETWEEN THE BEHAVIOR OF A FUNCTION 
AND THE ABSOLUTE SUMMABILITY OF ITS FOURIER SERIES.* 


is a 


TMs By KiEn-Kwone CHEN. 


ues, 


lane 1, We suppose that f(t) is a periodic function, with period 2z, integrable 


such 


in the Lebesgue sense. We write 


o(t) = 3{f(@ +t) + 


and suppose the Fourier series of @(¢) to be 3Ancosnt. Bosanquet? has 


shown that the convergence of % | An | does not imply that the mean-function 


is of bounded variation in (0,7). It has, however,? been proved that the 


that 


later property on [¢(¢)], is a consequence of 


| An | logn < ©. 


In the following section we prove that the convergence of & | dn | implies that 


Jf, | (Clog dt < (0<8<1). 


This is the case p(t) =| logt |-*, A =8 of Theorem 1. The chief object of 
the present paper is to establish Theorem 2 which includes both the theorems 


stated above. 


itzen 


seal 2 THEOREM 1. Jf the Fourier series of f(t) converges absolutely at 


t= then the function 


x. 


yncil, 


ts of bounded variation in (0,A), where p(t) denotes any function which is 


ula,” 


768. absolutely continuous in (0,A) such that 


A 
("| tp(t)| dt < @. 
70 


* Received May 1, 1944. 
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As stated above, this proposition is involved in Theorem 2, but there is a 
direct proof much simpler than the proof of Theorem 2. 
In fact, if 0< t< A then 


sin nt d sin nt 
1 t = n = 
Since 
A P sin nt A 
and | 
d sin nt A 
tts -1 


it follows from equation (1) that 


A A +A 
0 0 0 
This proves the theorem. 


3. Write now o,”% for the n-th Cesaro mean of order « of the series SAn. 
The series SA is said to be super-absolutely summable (C,a) to the sum S 
with respect to the sequence 1 = {In} (0 < In < Inii) or summable /| C, a 
to S, if o,*—> S and the series 


| On* — On-1% | 


is convergent. We denote this mode of summability by the equation 


(2) YAn = S(1| C,a|). 


We also write, for an integrable function (uw), 


[x(t) la = —u)*1y(u)du (a > 0) 


as the mean function of order « of x(w) on (0,¢). 

The theorem on the summability {logn}|C,0| stated in 1 has been 
extended as follows: * if (2) holds true for a sequence / = {(logn)?} (p=1) §& 
and an order « >—1, then the function | is of 
bounded variation in (0,7), provided that A<1, AS=1— (a), where 
denotes « — [a], the fractional part of «. The proof depends upon a number 
of lemmas, some of which are recalled in the next section as they are relevant 
to the investigations of the present paper. 


*K. K. Chen (1). 


4 


THE ABSOLUTE SUMMABILITY OF FOURIER SERIES. 241 


4. Lemma l. Let and 
write 


then the series 
oo co 
(—2 — n-pC (nt) 


n= 


converges to a function Fy(t) such that 
Fy(t) = O(vt)* (p: a@ positive constant), tF’y(t) =O(1). 
Moreover, 1f vt = 1, then 
Fy(t) = B(1— A, 1+ 4) + O(t)™, F’,(t) = O(v). 


LEMMA 2. Under the conditions of Lemma 1, we have 


A/e 


ra 
J u) du = (av* — ov-1*) = 0), 
0 v=0 
provided that & | on*-—on-1* | converges. 


5. The method we have chosen for the purpose of establishing the 
theorem stated in 8 may be generalized. Indeed, we can prove the following 


theorem. 
THEOREM 2. Let p(t) be a monotone function of t in (0,7), satisfying 


the conditions 


(3) O0<p(t)=O(|logt|4), p(t) =o(L(t)), 


T 
as t—>0, where A is a constant, and L(t) = f p(uywrdu. If the Fourter 
series of f(t), at the point t=, is summable 1| C,a| for an order a > —1 


lo the sum 0, where 1, = L(1/n), (n—1,2,3,-- -), then the function 
 (A<1, AS1—(a)) 
is of bounded variation in (0,7). 
Before proving this theorem it is convenient to give first the proof of the 


following lemma. 


6. Lemma 3. If the derivative p’(t) of p(t) exists and is continuous 
for0 <trz, then the conditions (3) involve 


/ 
ig 
Ss § 
een 
1) § 
of 
(a) 
ant 
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t 
(4) (1/t) p(ujdu—o(L(t)), (#90). 
Firstly, we prove that the relation 


(5) (wywau = 


holds true for every positive p, as t—> 0. 
In fact, in virtue of p(t) = 0(1), integration by parts gives 


(6) = p(t) i? — few) pu?*du. 


Using the relation p(t) = o(L(t)), we have 
t t t 

(7) f o(u) -purtdu—o( L(u) pu-tdu) = +f u?p(u) du), 
0 0 0 


in view of the fact that L(t) = =o(1). 
t 


Since p(w) > 0, it follows from (7) that 


pu’ *du = o(L(t)t?). 


On combining this relation with (6), we obtain (5), by using p(t) = o(L/(t)) 
again. 
On integration by parts, we obtain 


t t 
(1/t) p(t) — (1/t) up’ (u)du = 0( L(t), 
by appealing to (5) and (3). This establishes the lemma. 


7. Lemma 4. If the Fourter series of f(t), at t= a2, is summable 
1|C,a| (a >—1) to the sum 0, then there is a sequence {tn} such that 


0< tn—> 0, lim W(t) = 0, 


where 1 = {ln}, In = L(1/n), L(t) = f and the functions p(t) 
t 


and W(t) satisfy the conditions of Theorem 2. 
In fact, by hypothesis, the positive function 


= p(t)/L(t) 
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is 0(1), as £0. We shall use the function 
(8) p = p(t) = max + 1/2e(#) log t), 


In virtue of the monotony of p(t), the derivative p’(¢) has a definite sign 
for small ¢. If p’(¢) = 0, we have 


L(t?) /p(t) =1/e(t) — (A/o(t)) (o(u) /u)aw 
> 1/e(t) —'1/p(t) « p(t) log = 1/2«(t), 


by (8). Or, if ¢ is small enough, 
p(t)/L(#) < 2(t). 
This result is also true, when p’(t) > 0. Indeed, if Z(0) is a convergent 
integral, then it follows from 
lim L(t)/L(#) =1 
t0 
that 
(9) p(t)/L(t) = e(t)L(t)/L(t) < 2e(¢), 
for small ¢. On the contrary, if L(0) = «, we have 
1< lim sup L(t)/L(t) =lim sup [dL (t) /dt]/[dL(t#) /dt] 
t0 t0 
= lim sup p(t)/pp(t?) S1/p S 4/3. 


From this (9) follows, if ¢ is sufficiently small. 
Since the summability 1|C,«| of %An implies the convergence of 
% | on*—on.+* |, we have by Lemma 2, 


= W(t) /p(t) = Jara 
=(2 = ) (on* — on-1*) Fn(t) = + 


By Lemma 1, we have F(t) == O(1). Hence 


6,(t) =0( > on*— on-1? |) = O([1/L(t)] ln | on* — on-1% |). 
It follows that 
(10) p(t)®.(t) =o(1), 


as 0, since = o(L(t)) and SA, is summable | C,¢ |. 
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Writing [¢*] = T, we have to consider ®,(¢) wherein 
F,(t) = B(1— A, 1+ a) + O(vt)™, 


by Lemma 1. Consequently. we may write 


(11) ,(¢) =(#+1)B(1—A, 14+ 2) + (on* — on-1*) O(nt)***, 


temembering that = 0(1), we have 


= (ona*— on"). 
n=T+1 


Accordingly, | p(t)or* | S(t) In| on*—on-.*|. Hence we obtain 
n>T 


(12) p(t)or* = 0(1). 
In virtue of the relations «+1>0, Ty1, we have, on writing 


M = M(t)[t7], 


M T 
(13) >. | On* — On-1° | (at}**= | -— On-1% | + DS | ont*—o.! 


n=0 n=M+1 
It follows from (8) that 
t(t-p) (Ata) exp(— (1 + a) /2e(t))}, 


so that 


= /4) + O(exp(— (1 + @)/2e(t) )). 


On differentiating the equation p(t)/e(t) = L(t) with respect to ¢. and 
multiplying by e«(¢)/p(¢), and then integrating the equality obtained from 


t to 1, we get 
1 
p(t) /e(t) =C exp( (<(w) /u)du 
t 
with C—L(1). Take a positive number K small enough such that 
(15) Kp(t) = | logt |A, 
anl set G=A-+41. If there exists a positive number ¢, such that 


e(t) log 1 /t=@G 


nd 


om 
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for 0 << tt, then we have 
p(t) /e(t) = Cexp( f“(e(w) /u)du 
= C exp(G (log t/log to) = C'(log t/log to) 
On combining this result with (15), we obtain 
(16) 1/e(t) = (KC/| log ty |%) |logt|  (0<t Sh). 


Writing 


y =min((1+«)/4, (1+ 4) KC/2 | log 
it follows from (14) and (16) that 


(17) p(t) Ts(t) = O(p(t)) + O(exp(—y! log |)) 
= 0(t7 | logt |4) =o0(1). 


On the contrary, if there exists no t) such as assumed above, then there is a 
sequence of positive numbers ty, ty > 0, satisfying 


e(ty) log 1/ly < G; 
and (14) gives 


(18) p(tv) (tv) = + O(p(ty) )exp(— (1 + a) /2G log 1/tv) 
= | log ty |4) = 0(1). 


By (17), the relation (18) holds good for any case. 
Finally, it follows from 


\ N 
T2(t) | On* — On-1° | = (1/L(#)) ln | — On-1° 
M+1 


M+1 
and p(t) < 2L,(t?)e(¢) that 
(19) p(t) < 2e(t) bs | On* — On-1% | o(1). 


Combining the results (10), (11), (12), (13), (18) and (19), we arrive 
at the relation 


(ty) (ty) = p(tr) Pi (tr) -+ p (tr) ®2( ty) = 0(1). 


This proves the lemma. 


8. We can now prove Theorem 2. It follows from Lemmas 1 and 2 
that the relation 


2 | 
m | 
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co 
(20) = (1+ a) 2 — ona?) 
holds true for 0 < tz, since F’,(¢t) = O(t*) and the series 


> | on* — | 
is convergent. 
Using Lemma 1, we have 


— 


The second term is 7, And the first term is by Lemma 3, o(Jn) as n— 0. 
Hence 


LY’ 
(21) Jf | Pra(t)| dt ~ by 
By (20) and (21), there is a constant K satisfying 
0 n=0 


Now, let t, be the numbers given by Lemma 4; then (tn) —o0(1). 
Integration by parts gives 


(8) | dt | a 
= p(x) | f “o(t) (dat) | dt. 
In virtue of the monotony of p(t), the above relation implies that 
Hence, by (22), 
(23) p’(t)®(t)| dt = p(x) | + K Bln | on*—on1° |. 
0 


We have W(t) p(t)®(t) + p’(t)®(t). Hence, by (22) and (23), 
we obtain 


W (t)| dt S p(x) | + 2K | on? —on-1* |. 


This establishes Theorem 2. 
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9. In his paper “ The absolute Ceséro summability of a Fourier series,” 4 
Bosanquet asserts nothing concerning the behavior of the function 


when the Fourier series An is summable |C,a| (#20). It should be 
observed that the summability | C,« |, is equivalent to the summability 


{L(1/n)}jC,a|  (#>—1) 


in the case where L(¢) converges to a finite number, as ¢—>0. Accordingly 
we can infer from Theorem 2 the following proposition. 


THEOREM 3. If the Fourier series of f(t), at t =<, is summable | C, « | 
(a > —1), then the function 


is of bounded variation in (0,7), where p(t) is any function which is abso- 
lutely continuous in (0,2) such that 


f, at < @, 


and where 2$(t) denotes f(x+t) + 
In fact, the relation (21) is now reduced to 


J, at 0(2). 
Consequently, (22) becomes 


T 
(24) dt SK | 
0 
We next have 


fle ®Fa(t)| t= 
by the relations = O(1) and p' (t)| dt << co. It follows that 
(25) dS KS | —ons* |, 
K’ being a constant. 


*L. S. Bosanquet (2). 
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On combining the results (24) and (25), we obtain 


The theorem is thus proved. 
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A GENERALIZATION OF HARDY’S THEOREM WITH AN 
APPLICATION TO THE ABSOLUTE SUMMABILITY 
OF FOURIER SERIES.* 


By Kirn-Kwone CHEN. 


1. Suppose that f(x) is integrable in the Lebesgue sense on any finite 
interval, and that f(x + 2r) =f(«#). Then [¢(¢) Ja, the mean function of f, 
at the point x, of order a, may be defined by the equation 


ats 


where a > 0, 26(t) =f(x-+t) +f(x—t). If [6(t)], is of bounded varia- 
tion in (0,7), then z is said to be a point of de la Vallée-Poussin for f. At 
such a point, the Fourier series of f is summable |C,a| for every a > 1.7 
The point z is said to be a Dini point of f, if t*@(¢t) is integrable in the 
Lebesgue sense on (0,7). 

Hardy * has proved that a Dini point is a point of de la Vallée-Poussin. 
Evidently, the integrability of ¢*¢(¢) depends only on the behavior of the 
function f in the neighborhood of the point z. Bosanquet and Kestelman * 
have shown that the summability | C,1 for a Fourier series at a given point 
is not a local property of the function under consideration. Therefore, at a 
Dini point of f, the Fourier series is not necessarily summable | C,1 |. 

Theorem 1 and Theorem 4 of this paper are generalizations of Hardy’s 
theorem. From these propositions, we derive Theorem 2 and Theorem 3. 
Direct proofs of the latter theorems are also given. 


2. We begin with the following 


LemMA 1. /f I(t) L(0, 7), then 


et u 
h(t) vl(v)dvdue L(0, 7). 
0 


To prove this proposition, we may assume that /(¢) is non-negative. Then 


we have 
* Received August 17, 1943. 2G. H. Hardy 1. 
*L. S. Bosanquet 1. °L. S. Bosanquet and H. Kestelman 1. 
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t t t 
h(t) = (1/#*) wl(w)dw— w*l(w)dw S (1/t?) wl(w)dw 
0 0 0 
and the result is a consequence of the following lemma: 


Lemma 2. Jf l(t) then 


t 
ul(u)due L(0, 7). 
0 


For the proof, we can assume that l(t) 20. It follows from Hardy’s 
theorem that the function 


L(t) (1/t) fwl(u)au 


is of bounded variation in (0.7). This involves the existence of the limit 
L (+0). Integrating by parts, 


f "Uthat = iw + 


This establishes Lemma 2. 


8. If the Fourier series of f is absolutely summable C at the point 2, | 
then t-$(t) « L(0,2) for every X <1, but not forX—=1. The proof of this 
theorem will be given in another paper. 


Let us now assume that the integral 
x(t) = f 
+0 


exists in the Cauchy sense, i.e. the limit lim exists as a finite number, 


€—>+0 € 


Assume further that the function /(¢) = ¢"y(¢) is integrable in the Lebesgue F 

sense on (0,7). Then, the mean function [¢(t) ]. of the second order is of F 

bounded variation in (0,7). 
In fact we have 


[¢(t)]2= (2/t) f° 
= (2/t?) wx(u)du— (2/84) 


Integration by parts yields 


q 
4 


ly’s 


mit 


this 


iber, 


sgue 


is of 
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On differentiation, we obtain 
(d/dt) (1) (2/4)x(t) — (6/7) fx(u)du + (8/8) f° 


Hence, by Lemma 1 and Lemma 2, we have 


0 
The following theorem has thus been established : 


THEOREM 1. If t+y(t) L(0,7), then [¢(t)]. is of bounded variation 
in (0, 7). 


A combination of Theorem 1 and a Theorem of Bosanquet‘* gives 


THEOREM 2. If t*y(t)¢Z(0,2), then the Fourier series of f at the 
point x is summable | C,a| for every «> 2. 


4. Next, we shall give a direct proof of Theorem 2. This is based on 
the following two lemmas. 


LeMMA 3. Suppose thata<1,b>1andh(t) 20. Then the existence 
t 

of the integral f h(t)t°dt implies the convergence of the following two 
0 


serves 


co 1/n co 1 
t*h(t)dt and Sn (t)dt. 
1 0 e/ 1/n 

In fact, we have 


*1/n 
Un =n (t)dt = min{ (nt)~?, dt 
0 


0 
ty i h(t) dt min{ (nt)~, (nt)-?}h(t) de. 
Writing 
H(t) = ¢ min{ (nt), (nt)~} 
we have 


On account of the hypotheses h(t) =O and ¢h(t) « Z(0,1), the con- 
vergence of the two series St», and Sv, follows from the boundedness of H(t). 
We have 


*L. S. Bosanquet 1. 
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ntS1 nt>1 
2—a b 


This completes the proof of the lemma. 


Lemma 4. Write g*(n,t) for the n-th Cesdro mean of order « of 
sin nt (n=0,1,2,-- -); then fora 


(d/dt)*g*(n, t)| S Knd(1 + nt) 
where »—=min(a,1-+A), K being a constant. 
This is a known theorem.*® 


Let 2 < B < 3, and write r,° and o,® for the n-th Cesaro mean of order 
B of the sequences 


+a, +:--+an} and {nan} 


respectively, where a, (n =0,1,-- -) denote the Fourier coefficients of the 
function ¢(¢). If the series 


co 
(on? On-1°) 
n=0 


with o_,° = 0, converges absolutely to the sum S, then we write 


bay + —s | C,B |. 
An elementary calculation gives 
On? -— = 
We have to show that the convergence of 3n7?| r,°| follows from 


(t) € L(0, 
Since 


Nd» = (2/m) (d/dt)sin ntdt, 


we have 


— (2/n) (d/at) g°(n, t)dt. 


Now, on integration by parts, 


5.N. Obrechkoff 1. Cf. also L. S. Bosanquet 1, p. 519. 
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(d/dt) g8(n, t) dt 
= [ty(t) (d/dt) (d/dt) {t(d/dt) g8(n, t)fdt. 
This is equal to 


O(n) + +. nt)-2dt +n + nt)-Pdt, 


in view of Lemma 4. Hence there is a constant K such that 
1/n 
|S KX(1/n2) + kx f | x(t)| dt + | x(t)| de 
0 1/n 
1/n T 
4 Kxn f x(t)| dt + f | y(t)| de. 
0 1/n 


The convergence of the last four series is, by Lemma 3, a consequence of the 
integrability of ¢*|y(t)|. This establishes the relation 


+ = 81 |, 


for 2< 8< 3. The proof of Theorem 2 may be completed by the following 
theorem of Kogbetliantz.® 


Lemma 5. If >0, then 3a, —S|C,a| implies 3an—S|C,B| for 
B> «a. 


5. Suppose that /(¢) «L(0,7). Write 


tn-1 tn-a ty 


Lemma 1 can be generalized as follows: 


Lemma 6. The relation 1(t) «L(0,2) tmplies hn(t) L(0, 7), for every 
positive integer. 


In fact, integration by parts gives 
t 
hn (t) = — du. 
0 
To obtain the relations hn(t)«L(0,7) (n=1,2,: + -), we can assume that 


°K. Kogbetliantz 1. See also E. Kogbetliantz 2. 
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i(t) =0 so that An(t) S hn+s(t), for n= 2,3,---. Then, Lemma 2 com- 
pletes the proof. The arguments developed in the preceding article can be 
used to extend Theorem 2. 


THEOREM 3. If the repeated Cauchy integral 


exists as a function integrable in the Lebesgue sense on the interval (0, 7) 
then the Fourier series of f, at the point x, is absolutely summable 
(C,m+1-+.) for every positive «. 


In fact, let 8 > m+ 1; using Lemma 4, integration by parts gives 


(—1)™ f"Ut) t)at + O(1/n) 
0 k=0 


+ (1 + nt)-7O(nt)™ dt + O(1/n), 


where y= min(B-+m-+ 2). Hence, there is a constant K such that 


m+1 
k=0 


0 


1/n 
#2 | 41(t)| dt + Kn? | t1(t)| dt 
l/n 


+ Kn fm | dt + O(1/n?). 


From this, the convergence of Sn | r,° | follows, in view of Lemma 3. 
Theorem 3 can also be deduced from Bosanquet’s criterion. Indeed, we 
have the following theorem: 


THEOREM 4. If the repeated Cauchy integral I(t) in Theorem 3 exists 
as a function integrable in the Lebesgue sense on (0,7), then the mean func- 
tion [b(t) of order 1+ m is of bounded variation in (0,7). 


Write = ¢o(t) =», and for v > 0, 


t 
then 1(t) =t"dm(t). From 


Jma/(m +1) — = f 


—(f (f 


be 


we 


ists 
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we infer that 
t t 
+1) — (m + 1) ( 


so that ] ms, is a linear combination of the functions 


t t t 
t™ ( f dt) dm, | dt)*dm, ( dt)™* dn. 
0 e7 0 


Consequently, the function [4(t) ]m.. is a linear combination of the functions 
or of the functions 
th, (t), the(t),° > thmss(t). 
On differentiation, we find that 


(d/dt) [6(t) Col (t) + cyh,(t) + + 


where Co, Cms1 denote constants. Since 1(t), and by Lemma 6, hy(t) 
are integrable in the Lebesgue sense on (0,7), it follows that [¢(¢) ]’m41 is 
absolutely integrable on (0,7). Hence [¢(t)]ms: is of bounded variation 
in (0,7). 
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THE LAGRANGE MULTIPLIER RULE FOR TWO DEPENDENT 
AND TWO INDEPENDENT VARIABLES.* 


By CHARLEs B. BARKER, JR. 


Introduction. The Lagrange Multiplier Rule in the Calculus of Varia- 
tions may be briefly described as follows. Suppose that 2,(,y) and 22(z, y) 
are functions which minimize an integral of the form 


ff Y; 215 225 2145 dydx 


G 


among all functions z,(7,y) and z(x,y) which assume certain prescribed 
boundary values on G* and satisfy a given differential equation, 


$( 2, Y; 21> 225 21a Zoey) 0. 


Then there exists a function A(z, y), which is continuous together with its 
first derivative on the closed region G and is such that it satisfies the equa- 


tions of variation, 


(0/02) (fere + Aberz) + + Abay) = fey + Adz, 
(0/0z) (feos + + (0/dy) (feo, + fies + Adz, 


in which the unwritten arguments of the partial derivatives of f and @ are 
Z,(z,y) and Z.(x,y) and their partial derivatives. 

The existence of the function A(z, ¥) is demonstrated in this paper for 
all Z,(z,y) and Z.(2,y) which form a ‘ quasi-normal surface,’ an explicit 
definition of which will be given later. The first result obtained in a study 
of the Lagrange problem for double integrals was that of Gross [4].' His 
result was re-established by Coral [3] under weakened conditions. A further 
result obtained by Coral in the same paper is closely related to the result of 
this paper. However, the region of integration considered by Coral is con- 
siderably more restricted than the one considered here, and Coral’s differential 
equation does not contain all of the partial derivatives of the functions. On 
the other hand, the condition of ‘ quasi-normality ’? imposed upon the surfaces 
considered in the present paper appears to be more restrictive than the 


* Received July 11, 1944. 
1’ Numbers in brackets refer to the bibliography. 
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‘normal’ condition of Coral’s. A very comprehensive treatment of the 
problem in one dimension, together with an extensive bibliography may be 
found in [2]. 


1. Notation. The following notation is employed. If @ is a region, 
then @ indicates its closure and G* its boundary. A function, f(z, y), is said 
to be of class C™ on G if the function, together with its partial derivatives 
of the first n orders, are continuous on G@ and coincide with functions which 
are continuous on G. A region G is said to be bounded by a simple, closed, 
regular curve of class C,™ if G* is rectifiable and if the parametric equations 
of G*, r—-2(s), y=y(s), s being the are length, are of class C™ with 
(s) and satisfying a uniform Holder condition with exponent a, 
0<a<1. The symbols p; and q; are used to represent, respectively, 


and (0/dy)[z:(2, y)], (i =1, 2). 


2. A preliminary transformation. In this section, a particular one-to- 
one transformation of the entire (2, y)-plane into the entire (é,7)-plane is 
established. This mapping possesses specific properties which are essential 
to the solution of the problem. In order to obtain the desired transformation, 


it is necessary to prove certain preliminary lemmas. 


LemMa 1. Let G be a region bounded by a simple, closed, regular curve 
of class Suppose that f(x,y) and g(x,y) are functions of class 
on G, and that 


(1) [f(z,y)]?+ lg(a,y¥)]? > 0 


at each point of G. Then there exist two functions, r(x, y) and 0(x, y) which 


Add 


are of class C’’” on G and which satisfy 


(2) r(x, y) cos 6(«,y) =f (2,9), 
(3) y) sin 6(a, y) = 9 (a, y) 
at each point of G. 


Proof. Let 


(4) r(a,y) = + {[f (x,y) ]? + Lo y) 
and let 
(5) 6(x, y) [4 (a, y)dx + B(a, y) dy] + 6(2o, yo) 
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where (2%, Yo) is a point of G, and where 


(6) A(a,y) B(x, y) P+g 


By Green’s Theorem, if FR is any rectangle contained in G, 


f (x, y) dx + B(a, y)dy] =0. 


Hence, by a well known theorem (see, for example, [6]), 6(z,y) is of class 


C’ on G, and is such that 
(7) 00/0x = A(z, y), 00/dy = B(x, y). 
Further, since 
A(z,y) = (0/dx) (arctan g/f), B(x,y) = (0/dy) (arc tan g/f), 
it follows that 
(8) cos f/+ {f+ 9°}4, sind=g/ {f? + 9°}4, 


where the signs in the denominators are to be taken the same. Thus, if 
6(Zo, Yo) in (5) is chosen so that the positive sign holds in the first place, 
that sign will always hold on G, since 6(z,y) is continuous there. In view 
of (1), (4), (6), (7), and (8), r(z,y) and 6(2z,y) are of class C’” on G, 
and they clearly satisfy (2) and (3). 

LemMMA 2. Let G be a region bounded by a simple, closed, regular curve 
of class C,’’, and let a(x,y), b(a,y), c(x,y), and d(x,y) be functions of 


class C’” on G, with 
(9) [a(z, y) ][d(z, y)] — [b(2, y) ]Le(z, y)] 40 


at each point of G. Then there exist four functions, r1(t,y), 0:(2, y), 


r2(x,y), and 0.(z,y) which are of class C’” on G, and are such that 
oY 


(10) 9) €080:(2, == sin y) 9) 5 
T2(Z, y) cos y) =b(z,y); y) sin y) y), 


on G. In addition, 6,(z,y) and 6,(2,y) satisfy the inequality 
(11) 0< mS | 6,(r,y) —O,(2,y)| SM <z, 


at each point of G. 


| 
| 
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Proof. Statement (9) implies that 
[a(z,y)]?+ [e(z,y)]*A0, and [b(2,y)]’ + [d(z,y) 40 


on G. Thus Lemma 1 applies, and hence there exist four functions 7,(2, y), 
6,(z, y), F(x, y), and 6,(2, y) which satisfy (10) on G and are of class 0” 
there. Now, 


ad — be = 7,7 sin (6, — 62), 
and so, as a result of (9), 
y) y) Amn 


for all integral values of m. Since both 6,(2, y) and 82 (2, y) are continuous, 
there exists an integer N such that 


Na < [6.(2, y) —0:(2,y)] < (N +1)z, 
or, 

0 < [{6:(2,y) — Nw} —6,(2,y)] <7. 
Accordingly, if 


ri (2, y) =7,(2, y), 6, (x,y) 
= + (2, 62 (2, y) = 6,(z, y) — Nz, 


where the plus or minus sign is taken according as N is even or odd, it is seen 
that equations (10) are satisfied and that 


(12) 0 < | 6.(2, y) —6:(2,y)| 


In view of the continuity of 6,(2,y) and @.(x, y), (12) is equivalent to (11), 
thus concluding the proof of the lemma. 


LemMMA 3. u,(X), Us(@), Usg(x), and us(x) be functions of class 
0”, 0’, and C, respectively. Then there exists a function U(x, u) which 
ts of class C’”’ for y= 0, and is such that 


U (2,0) = u,(z), Uy (2, 0) == ts(2), 
U,(2,0) =,’ (2), OU exe 0) =u," (2), 
Uy (z, 0) == ue(z), 0) = (2), 

0) == Oayy(z, 0) = us’ (2), 
(2, 0) = (2), Oyyy (2,0) = us(2). 


Proof. Let 
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J 
(@) ; v2(t,y) = (y/1!)[(1/2y) f u2(€) dé] ; 


vs(z,y) = (1/2y) X 
f tary [(1/2y) (us(o) 
n-Y 
If, now, U(x, y) be defined by 
U (x,y) y) + y) y) + 9), 
it may easily be verified that U(2x,y) satisfies the conditions of the lemma, 
Lemma 4, Let @ be a region bounded by a simple, closed, regular curve 
of class and let a(x, y), b(x,y), d(a,y), and f(a, y) 


be functions of class 0’ on G with 
(13) [a(z, y) y)] — [b(2,y) £0 


at each point of G. Then these functions may be extended to be of class C’” 
over the entire plane, with all functions and their first three derivatives uni- 
formly bounded and with (13) satisfied over the entire plane. 

Proof. By Lemma 2, there exist four functions, r;(r,y), 0:(z, 4), 
f2(@,y), and 6,(z,y) which satisfy (10) and (11). As a matter of con- 


venience, let 


(14) A(x, y) (x, y) — 6, (2, y). 
Now let 
(15) y=y(é7n) 


be the conformal transformation of the exterior of G onto the exterior of I, 
the unit circle in the (&, 7)-plane, the transformation being of class C’”” both 
ways, even on G* and T*. The existence of such a transformation is well 
known [5]. Now, let u(2z,y) designate, in turn, each of the functions 
11, 2, 0:, 02, 6, e, and f, and let w(é,7) be the transformed function on I™. 


Define, on I*, 
Ug = + UyYe, Un = + UyYn, Up(1, >) = ug cos + Uy sin ¢, 


(p,¢) being the polar coordinates in the (é,)-plane, with wz and u, the 
known values on G@*, and similar definitions being made for the higher 


derivatives of u(p,¢). 
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Now let h(p) be defined as a function of class C’” which is such that 
(17) h(2) =1. 


Let wu represent the functions 1;, r2, (2/r)6, (2/1) 62, and f in turn. 
Then for the closed annulus, 1 = p = 2, let w(p,) be defined by 


u(p,p) = U(p.¢)[1—-h(p)] + Ch(p), 
where the constant C is unity for r,, (2/1)0, (2/7) 62, e, and f, is zero for 6,, 
and is + 1 for r., the sign being that of r. on I, and where U(p,¢) is the 


function U(x. y) of Lemma 3, defined for 


=u(1, >), = Upp(1, 9), 
u(r”) = Up(1, ¢), = Uppp(1, 
Clearly u(p,¢) is of class C’’ for this region, and it may easily be verified 
that w(p,¢) matches up, together with its first three derivatives, with the 
known values on I*. 

From (4), (11), and (14), it is observed that 7,, 72, and @ possess bounds 


on the unit circle of the form 


lr [2k>0, 0< 2m/rS | (2/r)0| < 2. 


These bounds cannot necessarily be preserved on the annulus, but it is possible 


to maintain the following set of conditions there: 
(18) /2 > 0, | re | > 0,0 < m/e S | (2/r)0| S(M 4+ 2) < 2. 


Suppose that p, is the smallest value of p at which any violation of (18) 
occurs. Then let h(p) satisfy (16), be monotone, and satisfy 


h(p) =1 for (1 + pi)/2Sp—2. 


Clearly this preserves the bounds of (18). The other functions, together with 
all derivatives, are evidently bounded on the closed annulus, since each is 
continuous on a closed set. 

For p > 2, let u(p,@) and its first three derivatives be defined to possess 
their respective values at p= 2. As these values are all constants, u(p,¢) is 
clearly of class C’” for p > 2, and all functions, together with their derivatives, 


are bounded. In particular, 
(2/)0(2, = 11 (2,6) =| 72(2, 6)| = 1, 


so that (13) is seen to be satisfied for all p > 2. 
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Now let the inverse transformation of (15) be applied. The functions 
all return to their original values on G* and all derivatives match up on G*. 
Outside of G*, the functions are of class C’” and are uniformly bounded, as 
are all derivatives. In particular, (2/)6, 7,, and r2 are bounded by the trans- 
formed conditions (18), and thus (13) is satisfied everywhere. 

As an additional result of this lemma, due to the nature of the boundary 
of G, there exists a square having G* in its interior, and having the functions 
6, and 6, equal to 0 and 7/2, respectively, on the boundary and outside of 
the square. 


Lemma 5. Let the curve C, 
t=2(s), y¥=y(s), aSs=a-+k, 


s being the arc length, be a simple closed curve with no cusps, and be such 
that the interval (a,a+k) may be divided into a finite number of sub- 
intervals, (Sir,Si) im each of which closed intervals x(s) and y(s) are of 
class C’’. Let x(s) and y(s) be defined for all values of s by: 


a(s+k)—=2(s), y(s+k) =y(s). 


Then there exists a function, 60(s), uniquely determined except possibly for 
a multiple of 27, which is such that 


| 6({a + k]*) —0(a*)| = 2x, 
cos =2’(s), sinO(s) =y(s), 
cos 6(s;*) = 2'(s;*), sin 0(s;*) = y’(s;*), 
cos 6(s;-) = sin = y(si7), 

| 6(si*) —O(si-)| < 


Proof. This is a well known theorem, and may be found in [7]. 


Lemma 6. Let 6(x,y) be of class C”’ over the entire plane with | 6|, 


1@,|, and | 0,| all bounded by M for all (x,y). Let r=-2(s), y=y(s), F 


s being the arc length, be the solution of 


(19) dz/ds = cos 8, dy/ds = sin 6, 
for which 
=o, y(0) = Yo.” 


Then 


2The existence and uniqueness of these functions is guaranteed by well known & 
existence theorems in the theory of ordinary differential equations, and may be found 


in [1]. 
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(20) @(s:) =2(s2), (81) = 
implies that 
(21) $1 = 82. 


Proof. If one assumes (20) to be satisfied and (21) to be not satisfied, 
a contradiction can be readily obtained through the use of Lemma 5. 


LemMa %. Let 6:(x,y) and 6.(x,y) be of class C’ over the entire plane, 
be uniformly bounded together with their first derivatives there, and be such 
that 


0< mS | —6:(2,y)| SM 


for all (x,y). Then there exist unique functions of class C” 


t,(s), r,(t), and y2(t), 


such that 
(22A) dz,/ds = cos 6,, dy,/ds = sin 0, 
(22B) dx.,/dt = cos 62, dy,/dt = sin 62, 


and such that 
= 2, 9:(0) = 41, 72(0) = 44, y2(0) = 
For these functions, 


(81) = and yi (81) = y2(te) 
imply that 


Proof. This lemma follows as a consequence of the two preceding lemmas. 


LemMaA 8. Let 6,(2,y) and be of class over the entire plane 
with |0,|, | O21, | | | and | all bounded by M for all 
(z,y). Suppose also that 


0<ms 


6.(t,y) SU <r 


and y,(s,t) be defined for each s by 


known 
» found 2 


(0/dt) [x,(s, t) ] cos 62, (0/dt) [y.(s, t) ] —sin 0, 


with 


(s, 0) = 2,(S), y1(s, 0) = 9:(s). 


Then x,(s,t) and y;(s,t) are of class C’’ and the transformation 


r= y = 9:(s, 
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is one-to-one from the entire (s,t)-plane into a subset of the (x, y)-plane. 
Proof. This follows readily from Lemma 7%. 


LemMa 9. Let 6(x,y) be of class C’”’ over the entire plane with | 6|, 
| 02 |, and | all bounded by M for all (2, y). Let 


T= 2(s), y¥=y(s), 
s being the arc length, be the solutions of 
dz/ds = cos 6, dy/ds = sin 6, 
for which 
= 2p, y(0) = 
Then for each R there exists an s; > 0 and an s, > 0 such that 
(23) [a(s1) — 2%]? + Ly — Yo)? = 
(24) so.) — ao]? + [y(— 82) — yo]? = R?. 


Proof. Wet 2,(s,t) and y,(s,¢) be determined by - 


dx/0t = — sin 6, dy/dt = cos 6 
with 
(25) 2, (8,0) 2(s), yi(s,0) =y(s). 


Then by Lemma 8, the transformation 
«= ,(s,t), y = t) 


is one-to-one from the entire (s,/)-plane into a subset of the (2, y)-plane. § 
Let 
J,(s,t) = — LitYre, 
so that 
J;(s,0) =1. 

Also, since 

Jit = — (62 cos 6 + 6, sin 6) - Jy, 
it follows that 
(26) e?M\tl = J, (s, t) S 


In addition, 


+ yit?}# = 1. 


+T 8 
f f J,(s, t) dsdt. 
-T 0 


Integrating this and making use of (26), 


Now, let 


| 
f 


vé. 


plane. § 
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A(s) = (s/M) (1— e?7™™), 
and so 
(27) lim A(s) = + © 


In particular, if 7 —1, 
A(s) = (s/M)(1— 


Since (27) holds, there exists an s, such that the area in the (2, y)-plane 
corresponding to the set 
—1Si=1 


exceeds (R +-1)*. That is, there exists a point (80, ¢)) in this set such that 
[#1 (80, — 20]? + [ys (80, t.) — yo]? > (R 


In addition, 
[x (0, to) — a1 (80, 0) J? + [y1(s0, t.) — (So, 0) ]? 
so that 
[21 (80, 0) — + [y1(80,0) — yo]? > R?. 


Thus, in view of (25), it is evident that (23) holds. A similar method of 
attack will demonstrate the validity of the inequality (24). 


LemMA 10. Let G@ be a region bounded by a simple, closed regular curve 
of class Cy’, and let a(a,y), b(x,y), c(x,y), and d(z,y). be functions of 


on G, with 


[a(a, y) |[d(2, y) ] — [b(2, y) |Le(2, y)] 40 


class C 


at each point of G. Then there exists a one-to-one transformation 


(28) 


of the entire (x, y)-plane into the entire (é,)-plane, which, together with its 
inverse, is of class C’” over the entire plane, and in which é and n satisfy 


Exny Eyne 0, 
(29) ane Cy = 0, 
bé, + d& = 0. 


at each point of G. 


Proof. Let the extension of Lemma 4 be applied to these four given 
functions. Then the functions 7,(2,y), y), T2(x,y) and y), which 
are the functions of Lemma 2, are all of class C’”’ over the entire plane, and, 
together with their first three derivatives, are uniformly bounded everywhere. 


265 


266 CHARLES B. BARKER, JR. 


In particular, there exists a square S: [ (20, Yo), (Go + &, Yo + &)] which con- 
tains @ in its interior, and which is such that 6,(z,y) and @.(2,y) equal 0 
and 7/2, respectively, on S* and at all points exterior to S*, as well as at 
all points (z,¥) interior to S which are such that their distance from S* is’ 
not greater than unity. 
Let 
y= yi(s), and r—2,(t), y= 


be the families of curves satisfying (22A) and (22B), respectively, and 
designate by C, and C, the curves of the first and the second families, respec- 
tively, through (2, yo). Now let P: (x,y) be any point, and let C’, and C’, 
be the curves of the first and second families through P. By means of Lemma 
9, we can easily show that C’, intersects C2, and that C’, intersects C,. Let 
P, and P, be, respectively, these two points of intersection. Then let s be 
the directed distance along (C, from P to P,, and ¢ be the directed distance 
along C, from P to P,. Let é(z,y) and (xz, y) be defined by 


E(@,y) =s8, n(z,y) =t. 
By Lemma 8 and the above, the transformation 
(2, 


is one-to-one from the entire (z, y)-plane into the entire (é, 7)-plane. 
Let 27,(s,t) and y;(s,¢) be defined by 


(0/0s) [21 (s,t)] = cos (4/0) [yr(s, t) ] = sin 
with 
,(0,t) =22(t), yi (0, t) = y2(t), 
y=y2(t) being the equations of C,. Let and y2(s, t) 
be defined by 
[2.(s, t)] = cos y2), (8/08) [yo(s, t)] — sin 2( a2, 


with 
X2(s,0) = 2,(s), Y2(s,0) 


@=2,(S), y= Y:(s) being the equations of 
By Lemma 8, all of the functions are of class C’”’ in both s and ¢. Also, 
it follows that both of the transformations, 


r= y = yi(s,t), (11,2), 


carry the entire (s,¢)-plane into the entire (2, y)-plane. 


| 
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If these transformations are reversed so as to obtain 
(30) S=5S,(7,y), t=1t,(z,y), and s—s8,(2,y), 
from the first and second, respectively, then it is seen that 


Now, let 
Ji(s, t) = LisYit — Vityis, 1,2); 


it may easily be verified that J,(0,¢) is never zero, and that since 
= [(0/dy) sin 6, + cos - Ji, 


J,(s,t) is never zero. Similarly, J(s,¢) is never zero. Thus the inverse 
transformations (30) are of class C’’, and hence both é and 7 are of class C””. 
Now let 


J(s, t) Exny Eyne. 
This may easily be reduced to 
J = sin [6.(z, y) —6,(2, y) |/J iJ. 


Thus J is defined everywhere and never vanishes. Hence the inverse of (28) 
is of class C’”” everywhere. 

Now, ‘ 

Ane + Chy = 11 COS + 71; SiN 

== + = 0, 


since J, never vanishes and since 
Vo =— Yis/J1, == 
Similarly, it may be shown that 


This is the transformation sought for in this section. The essential 
features that will be needed later on are its differentiability properties, the 
three statements of (28), and the fact that it is one-to-one from the entire 
(z, y)-plane into the entire (é, y)-plane. 


3. A lemma of general interest in the calculus of variations. In this 
section, we seek a lemma which is of quite general interest in the Calculus of 
Variations. It is necessary to develop several preliminary lemmas which yield, 
upon proper application of the transformation of the preceding section, the 
desired result. 
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LeMMA 11. Let B(a,y), «(z,y), and e(z,y) be functions of class C” 
on the closed rectangle R: [ (21, y1), (2, y2) with B(a, y) and a(a,y) never 
zero on R. Then there exists a function u(x, y) which is of class C”’ in x and 
of class C’” in y which possesses the boundary values 

U(X, 41) = Yo) Uy(Z, Yr) = Uy Yo) = 0, 
and which is such that 


” B(x, y)[a(a, y)ue(x,y) + e(a, y)u(a, y) dy = 1. 


v1 


Proof. Let v(z,y) be of class C’ and be such that 
a(x, + ¢(2,y)v(2,y) =1/B(a, y). 
If now, ¢(y) is any function of class C” satisfying 


(41) = $(¥2) = $'(y1) = ¢'(y2) = 0 
and such that 


0, 
V1 


then obviously the required function of the lemma is 


u(a,y) =v(x,y)6(y)/ 


LemMMaA 12. Let (x,y) be a function of class C’ on R, R being the 
rectangle: [ (21,41), yz) |, and suppose that 


(31) 


for all u(x, y) which are of class C’’, with Urr2(x, y) also continuous and zero 
on R*, which vanish together with their first and second derivatives on R*, 


and which satisfy 
Ya 
(32) f + y) ]dy=0, 
"1 


B(a,y), a(x, y), and e(2,y) satisfying the conditions stated for them in the 
preceding lemma. 
Then there exists a function x(x) of class C’ fora, x2, such that 


(33) $(2, y) + (4/dx) [B(a, y) a(x, y)x(z) — y) € (zx, y) x(x) = 0 
on R. 


ver 
und 


the 


R*, 
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Proof. Let u,(x,y) be the function of Lemma 11. Now let v(z.y) be 
any function of class C’” which vanishes together with its derivatives on R*, 
and define by 


f° [a(a, + e(2, t)v(2, t) Jat. 
"1 
Thus y(z) is of class 0”, and satisfies 


= = = = 0. 
Now let 
U(x, y) == (x,y) + y)Y(2). 


Obviously u*(z, y) satisfies both (31) and (32). If, now, 


f° y)dydx 


be manipulated by various interchanges of letters and orders of integration, 
as well as by integration by parts, there results the equation, 


(4) 9) 9) — Bley) (@ 
+ (0/dx) (B(x, y)a(z, y)x(x) ]}dydz = 0. 
where 


x(t) = fs t)u, (a, t) dt. 
V1 


Since (34) is satisfied by all v(a,y) of class C’” which vanish together with 
their first and second derivatives on R*, then the statement (33) follows 
immediately from the fundamental lemma of the calculus of variations. 


LemMMA 13. Let B(x, y), C(z, y), D(a, y), (2, y), and F(a, y) 
be functions of class O” on a simply connected region T with AD— BC ~0 
on T. Then a necessary and sufficient condition that there exists a solution 
30 of class C’ on LT of the two equations 


(35) (@/dr) (Ap)+(0/dy) (Ch)= Ep, and (0/0xr) (Bp) +(0/dy) (Du) = Fp 


1s that 


[D(E—A,—C,)—C(F—B.—Dy,) 
id dy AD— BC 
AD— BC 
on 


Proof. If the equations (35) be solved simultaneously for wz and py, 
it is found that 


| 
the 
that 
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_ D(E—A,—C,) —C(F-—Bs—D,) | 


od AD — BC Bs 
AD—BC 


From these, we may conclude that if »s£0 on T, then p» is never zero on I, 
and so logy is defined and is of class C” on I. Dividing these last two 
equations by yp, it is clear that, if (35) holds, then (36) is also satisfied. 
Finally, if (36) holds, there exists a function ¢(2,y) of class C” on I' 
such that 
D(E — A, —C,) — C(F — B,-— Dy) 


= AD — BC 
— = Be — Dy) — — Ae — Cy) 
by = AD — BC 


since Tis simply connected. Thus any function p» of the form 
Ce?, 


C being any constant, satisfies (35). 


Lemma 14. Let a(x.y), b(2,y), c(a,y), d(a,y), e(x,y), f(a,y), | 


a(x,y), S(x,y), y), and f(x,y) be of class C” on the rectangle 
R: [ (21; y1), Ye) | with a-8A0 on R. Let u(x,y) and v(2,y) be of 
class C’ on R, vanish on R*, and satisfy 
(37) + dvy + fv =0 
on R. 

Then 


(38) f f + bv, + cu, + dry + eu + fv) 
R 
f ul (0/dx) (aH) —- eH — (ar + cy —- e) ]dydz, 
R 


where 
1 *v f(z. t) 
Va t 
x exp [— dr|- [ba (x,t) + d,(x, t) — f(x, t) 


Proof. If u(z,y) and v(a,y) satisfy (37) on R and vanish on F*, 
we have 


(39) v(a,y) = K(z,y) f B(x, t) 1) + e(z, t)u(za, t) ]dt, 


where 
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K (2, y) [— dr], 


ni 8(2, r) 
and 
1) 
B(z,y) = exp [ dr]. 


Now if the left hand member of (38) be integrated by parts, and v(z,y) be 
replaced by its equal from (39), there will result, after various interchanges 
of the order of integration as well as several integrations by parts, the right 
hand side of (38), as required. 


Lemma 15. Let a(x,y), b(2,y), c(z,y), d(z,y), e(x,y), f(x,y), 
a(x, y), (x,y), y), and y) be of class C” with 2-80 on the closed 
rectangle R:[ (2, y2) Suppose that 


f (ate + + cu, + dv, + eu + fv) dydx =0 
R 


for all pairs of functions u(x,y) and v(x, y) which are of class C” on R, 
which vanish with their first and second derivatives on R*, and which satisfy 


(40) au, + dv, +eu+ fv =0 


on R. 
Then there exists at least one function (x,y) of class C’ on R which 
satisfies the two equations 


(4/ar) (a — da) + (8/dy) (c) — de; 
(4/ox) (b) + (A/ay) =f —2E 
on R. 


Proof. Let u(z,y) and v(2,y) be of class C” on &, vanish with their 
first and second derivatives on R*, and satisfy (40). Suppose that wze2(z, y) 
is also continuous and vanishes on R*, Then the equation (39) of Lemma 14 
holds. Jn order for v(2, y) to vanish on R*, it must be true that 
(41) f “B(a, t) [a(a, 1) ue(a, t) + (2, t)u(2, t) dt =0. 

Thus, since all of the conditions of Lemma 14 are satisfied, the result stated 
as equation (38) must hold for all u(z, y) satisfying all the stated conditions, 
including (41). From Lemma 12, then, it follows that there exists a function 
x(x) of class C’, such that 


(0/0x) (aH) —- eH — (az + cy—e) + (0/0x) (aBy) — «By = 0, 


where H and B are defined as in Lemma 14. 
Now, if we define 
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A(&,y) =H (x,y) + B(z,y)x(2), 
it may easily be verified that this function satisfies the required conditions 
of the lemma. 
Lemma 16. Let G bea region bounded by a simple, closed, regular curve 
of class C,”". Let A(a,y), B(z,y), C(a,y), D(x,y), E(2,y), F(2,y), 
a(z,y), b(z,y), c(x,y), y), y), and f(a, y) be functions of class 


on G, with 


on G. Suppose also that the set of points (x,y) where 


ay AD — BC ] 
Ot AD — BC 


ts nowhere dense. Suppose finally that 
(43) f (aur + + cuy + dv, + eu + fv) dydx = 0 
R 
for all u(x, y) and v(2,y) which are of class C” on G, which vanish together 
with their first and second derivatives on G*, and which satisfy 
(44) Auz + Bu, + Cu, + Dv, + Fut Fu =0. 


Then there exists a unique function r(x, y), which is of class C’ on G, 
and which is such that 


(0/dx) (a—AA) + (0/dy) —AC) =e —AL, 


(0/dx) (b + (0/dy) (d—AD) =f —aF. 


Proof. Let the transformation of Lemma 10 be applied to these given 
functions. Then (43) becomes, I being the transform of G, 


(46) f f (ayug + + City + divn + e1u + dndé = 0 
where 
a, (é,4) = {a[x(é, n) + c[x(é,y), y(é, n) — 


with b,, ¢:, d;, e;, and f, represented by similar expressions, and where u(é, 7) 
and v(é, 7) are the transforms of u(z,y) and v(a,y). Equation (44) becomes 


(47) + Cy Ey) Ue + (Bike + Dy&,) ve (Aine + Ciny) Un 
+ (Binz + Diqy) + Liu + Fw = 0, 


[A(#, y) — [B(«, 9) 9) 140 
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A,, Bi, Ci, D1, H;, and F; being the transformed functions A, B, C, D, E, 
and F respectively. From the properties of the transformation, the coefficients 
of WU, and vg are zero, and, in addition, 


+ (Bine + Dyn) (ALD, — B,C;) (Exny — Eyne) 0, 
so that (47) may be written as 
aug + dvy + eu + Cv = 0, 
a, 6, «, and £ having the obvious interpretations, with 


7) 
Also u(é,7) and v(é,7) are of class C” and vanish with their first two deriva- 
tives on 
Now let 7 be a rectangle in F, and let & be the corresponding region in G. 
Then, by Lemma 15, there exists a function A,(é,7) which is of class C’ on p 
and which satisfies 


(0/0€) (a, — + (0/0) = — Ane, 


(48) (b:) (0/0) (dy — 18) = fr — 


Now let a, A, etc., be the original functions of x and y with z(é, ») and y(é, 7) 
substituted in when the context demands. Let 
J = Exmny — J, = LEYn — 


On rewriting the first of equations (48), using the formulas for a,, a, etc., 


we have 
0 + c&y ] [ane 4 ] 
J ev) An J e/J — 


Now by manipulation of this last expression, including the use of formulas 
for the derivatives of the inverse functions, as well as certain properties of the 


transformation, we obtain 


MA» Vy A(Az Cy) Ars Cry = — AE, 
where 
= ,/J3. 


This is the first of the equations of (45), and the second may be obtained 
analogously. 

Thus on each region R& of the type indicated, there exists at least one 
A(x, y) satisfying (45). Since the set where (42) holds is nowhere dense 


i 
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in G, it follows that (42) cannot hold identically on R, and it follows from 
Lemma 13 that A(a,y) is uniquely determined on R and hence over the 
interior of G. 

Finally, if A1(é, 7) =A(a, y)/J, it can easily be shown that A;, Aig, and 
Aim have continuous limits on T'* by studying the behavior of these functions 
along a small part of [*. 


4, The Lagrange multiplier rule. In this final section, we wish to 
prove the Lagrange Multiplier Rule for the case of two dependent variables 
and two independent variables. 

Suppose, then, that f(2, y, 21, Z2, Pi, Pos 2) and Y, 21, 225 Pry V2) 
are of class C’”’ in their arguments, that 2,(2,y) and 2Z.(2,y) are of class 
C’” on a region G, the boundary of which is a simple, closed, regular curve 
of class C,’”, and that 2,(z%,y) and 2.(z,y) minimize 


(49) f Ys 215 Pry Pos dydz 
G 


among all functions %, 2. of class 0” on G, coinciding with 2, and %,, 
respectively, on G*, and satisfying 
(50) (2, Y, 21, 22, Pi, Pes qi: 2) == 0). 

Now suppose for the moment that 7,(z,y;p) and 7Z2(z,y;m) are of 
class C” in their arguments for (z,y) on @ and | »| < po, coincide with 
Z, and 2, if and satisfy (50) for | mo. Then, if we differentiate 
with respect to » and then set » = 0, we see that 


(51) + + da liy + + = 0 


and ¢, and ¢ vanish on G@*. Here we have set 


i= [ (0/0p) (a, fo = [ (0/0n) 72 (x, 


and ¢p,, etc., stand for op, (2, Ys Z15 225 Ziys Z2y), ete. Equation (51) is 
known as the equation of variation. 

We cannot prove the multiplier rule for all possible cases, but shall prove 
it only for the case where the surface z, =: 2,(%, y), 22 = 722(%,y) is ‘ quasi- 


normal’ with respect to the differential equation (50). 
Definition. The surface 2, = 7,(2,y), = y) is said to be quasi- 
normal with respect to (50) if 


(i) AD — BC 0 on G, (A B= C= D= Par» = 
F = ¢$2,) ; 


I 


22, 


late 
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(ii) The set where 


[ =! 4,— ] 
dy AD— BC 
Oa AD-— BC 


is nowhere dense on G; and 


(iii) If ¢, and & are any functions of class 0” on @ which satisfy (51) on 
G and vanish on G*, there exists a one-parameter family Z,(z, y;), Z2(x. y3 p) 
of pairs of functions of class C” in their arguments for (z,y) a point of G 
and |»! < po such that 


(a) 2,(2,y;0) = 4 (2,y), Z2(r,y;0) = 
(b) [dZ, Op | ==), [62 | = Lo; 


(c) for each » with | wi < po. and 43) coincide on 
G* with 2,(z.y) and z.(z,y), respectively; and 


(d) 4,(2,y;) and Z,(2,y;) satisfy (50) for each » with | »| < po. 


THEOREM 1. Let f, 6, G, 2, and 2, satisfy the differentiability hypotheses 
of the second paragraph of this section and suppose that the surface z; = 2, (2, y), 
% = 2.(2,y) is a quasi-normal surface with respect to the differential equation 
(50). Suppose that 2,(x2,y) and Z2(2,y) minimize the integral (49) among 


ws 


all pairs 2,(2,Y), 22(x@,y) of class C” on G, coinciding with Zz, and Z, on G*, 


and satisfying (50). 


Then there exists a unique function, A(x, y) of class C’ on G which 
satisfies the equations 


(0/0) (Fr, — AGp,) + (0/0y) (Far — Aba) = — 
(a 0x) (Fie + (9/dy) (fas Azz 


on G. (The notation of equation (51) is employed here.) 


Proof. Let ,(v,y) and ¢.(2,y) satisfy the conditions (ili) of the 
definition. Since the surface z, = 2,(2, y), 22 = 22(%, y) is quasi-normal with 
respect to (50), there exists a one-parameter family of pairs of functions 
Z,(x,y3p), Z2(x,y3;p) which satisfy the conditions (iii) of the definition. 
Let J(u) be the function of » obtained by substitution of the Z;(z,y;m) in 
the integral (49). Since 2, and Z, minimize this integral, we must have 


I’(0) = f f Haken + fetes + fetes + forks fete) dyda 
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That is to say. we have 


+ + Cry + dLoy + ef: + f2)dydz = 0 


where a = etc., for every pair of class C” on G, vanishing on 
and satisfying 


Abie + Blow + Chy + Dboy + Eg, + Fé. = 0 


where A = ¢,,, etc. The existence and uniqueness of A(z,y) follows from 
Lemma 16 of 8, since it is clear that a, A, etc., are all of class C””. 
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THE NON-LOCAL EXISTENCE PROBLEM OF ORDINARY 
DIFFERENTIAL EQUATIONS.* 


By AUREL WINTNER. 


Introduction. Let f;,---,fn be real-valued, continuous: functions of 
position on the (n + 1)-dimensional region 


(1) —astSa, w; (t—1,---,n). 
Then, corresponding to any set of real values c;, the system of differential 
equations 

(2) fi (05 (4—1,---,n), 


where 2’ = dx/dt, has near t =0 a solution xj = (t) satisfying 
(3) (0) = Cj. 


The italicized proviso is essential, even if the system is “ conservative ” (in 
the sense that (1) and (2) become 


(4) w; (t—1,---,n) 
and 
(5) x’; = fi (2, Lp) 


respectively) and even if the functions f; are regular-analytic on the whole 
r-space. 
For instance, if (5) is the single differential equation 


(6) == 


then, although f, =f, is a polynomial, every solution x(t) determined by a 
non-vanishing x(0) ceases to exist at a finite {= f). In fact, all solutions 
distinct from the solution x(t) ==0 (which belongs to x(0) —0) are repre- 
sented by x(t) (f, —?t)~*, where is a non-vanishing integration constant 
determined by the initial value 7(0). But the function (t) — ¢)-? cannot be 
continued beyond ¢ =f, so as to remain a solution. For, if it could, then, 


* Received March 28, 1945. 
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since it should satisfy the differential equation, it ought to be defined at t = t, 
in such a way as to be differentiable, hence continuous, at {== ¢). But this 
is impossible, since the solution is not bounded near ¢t = fo. 

These trivial comments are made only in view of (iii bis) below and in 
order to make it clear that the possibility of “moving singularities” in the 
particular case of analytic differential equations is just a manifestation of the 
local nature of the existence theorem in the general case. In this regard, not 
even Riccati’s type, though reducible to a linear differential equation, is ex- 
ceptional, since (6) is of his type. 

On the other hand, the existence theorem of the initial problem (3) is 
not of a local nature if the system (2) is linear and, without additional loss 


of generality, homogeneous, that is, if 
(7) +, tn) =far(t)ar + fin(t) an, 


where, by virtue of the assumption made with regard to the range (1), the 
n? functions f;,(¢) are continuous on the interval —a ta. In fact, the 
solution of (2) determined by arbitrary initial data (3) is known to exist 
on the whole interval —a=t=a in the case (7). As was shown in [1], 
pp. 553-556, the non-local existence theorem for the linear case can be 
obtained as a corollary of the local existence theorem for the non-linear case, 
if the limited t-range supplied bv the latter is applied repeatedly. The possi- 
bility of this reduction depended on the divergence of the harmonic series, 
=1/m = ©, a series resulting from the particular structure of (7). 

Loc. cit., this elimination of the non-local existence theorem of linear 
systems as an independent theorem was just a matter of convenience, namely, 
a way of replacing two proofs by one. It took me twenty years to observe 
that the reduction process applied in [1] has more to it than this, just 
methodical, interest. In fact, it turns out that the whole of this reduction 
process can be worded so as to avoid the assumption of linearity. And it 
happens that the resulting rewording leads to a solution of the non-local 
existence problem (“in the large,” i.e., in the “ unrestricted” sense of [3], 
pp. 85-86, 132, 341) for the non-linear equations (2) ; a solution which proves 
to be of a final nature. Cf. (iii*) and (iii bis) below. 

Both the result and its proof are very similar to the procedure applied 
by me to Strémgren’s empirical principle of natural termination. On the 
other hand, an application of Birkhoff’s subsequent variant of this proof oi 
the termination principle would not lead to a solution of the problem (that 
is, to (iii*), (iii), or, for that matter, to the weaker results (i*), (i), which 
are not final in nature), since Birkhoff’s variant consists in the replacement 


Fis 
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of the constructive process of prolongation by the indirect device of the Heine- 
Borel theorem. For references cf. [3], pp. 441-442. 

It should finally be mentioned that the proofs, which will be worded 
for the case of a real variable ¢, need no modification if ¢ is replaced by 
z—=2-+ 14; so that all results remain valid in the case of (ordinary, non- 
linear) analytic differential equations in the complex domain. 


1. The arrangement of the proofs becomes particularly convenient if 
the most general case, instead of being formulated directly, is reached through 
successive stages of straightforward generalizations. In fact, all the essential 
elements of the proof are needed for the following particular case: 


(i) If fi,°*+,fn are continuous functions of position on the space (4), 
and tf (C1, + + ,¢n) is any point of this space, then (5) and (3) have a solution 
2;(t) which exists on the whole t-aris, <t< w, whenever the 


given functions f; are subject to the n inequalities 
(8) fi(t1,° +, =O(|z|) as |r|. 


Here, and in the sequel, |x| denotes, not the Euclidean length of the 
vector x, but the length of the longest component of 2, that is, 


(9) | | = max (| 2,|,---,|an|), where -, 

(this leads to |x| —|2| if n=1). Accordingly, (8) means the existence 
of a constant A satisfying 

(10) | < Amax (| |,- -, | |, 2). 


The (n + 1)-th term, 1, on the right of (10) is adjoined in order to take care 
of the vicinity of the origin of the x-space. 

Since only the continuity of the functions f; is assumed in (i), the 
solutions to which the assertion of (i) refers will not in general be unique. 
In addition, the solution can be unique near ¢ =0 but then lead to branch- 
points. What will happen in such a case is clear from the wording of (i) and 
from the following proof. Needless to say, the emphasis in (i) is not on this 
generality, since what really is interesting in (i) is the case of “smooth” 
functions f; of the type occurring in dynamical applications (or, for that 
matter, the case of regular-analytic functions); cf. [8], p. 356. And the 
solutions must be unique throughout (and so, by (i), for —o <t< oo), 
if the given functions f; satisfy, for instance, the local condition of Lipschitz 
(e.g., if every f; is of class 0’). 
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If the n continuous functions f;(2,,- - -,2,) have an absolute value not 
exceeding M in the cube | —2io | Sb about a point *,2no), and 
if ¢) is arbitrary, then (5) has a solution 2;(¢) which satisfies the initial con- 
dition x; (to) = io and exists, at least, on the ¢t-interval consisting of those 
values which differ from t) by not more than b/M, and the n inequalities 
| z(t) — rio | = 6b hold for every ¢ contained in this interval. The lower 
bound 6/M for the ¢t-range of assured existence follows by the method of 
equicontinuous functions. The same 6/J/ results when, under an additional 
restriction of Lipschitz’s type, another method, such as the method of suc- 
cessive approximations, is applied. Actually, b/M cannot be improved in the 
complez-analytic case, a case in which b/M becomes a “ best universal con- 
stant” (of the type of absolute constants occurring in the theory of con- 
formal mapping) ; cf. [2]. 

If the constants 2jo,t),b are identified with ci,0,1 respectively, it is 
clear from the definition of M and from (10), that Mf can be chosen to be 
A max (c,1) = Ac, if c is any value exceeding each of the n + 1 numbers 
|c;|,1. Hence, if ¢ is fixed in this manner, then b/M —1/M becomes the 


reciprocal value of 4c. Consequently, if 
(11) t, == A-'/¢, 


then (2) and (3) have a solution z;(¢) on the interval 0O=¢= 4,, and the 
n inequalities | x;(¢) —c; | 1 hold for every ¢ contained in this interval 
and so, in particular, for ¢=1t,. It follows, therefore, from the definition 
of c that 


(12) | <<e+1. 


Next, let io, to, b be identified with 2;(t,), ¢:,1 respectively. Then it is 
clear from the definition of Jf (for the resulting cube) and from (10) and 
(12), that A max (ec + 1,1) = A(c-+ 1) is an admissible value of M. Hence, 
b/M =1/M becomes if denotes the quotient 


(13) A*/{o 44), 


Consequently, the solution 2;(t), the existence of which was assured for 
0 =t= tk, proves to be prolongable for 4; S¢St, + te. Since Lio = a4 
it also follows that the n inequalities | 2;(4) —ai(t,)| ST are satisfied for 
every ¢ contained in the latter interval and so, in particular, for ¢ = t, + 


Accordingly, from (12), 


(14) | + 


fe 
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If 2io, are identified with 2; (t, + t:-+¢., 1 respectively, it 
becomes clear that the possibility of repeating this procedure is never arrested. 
The m-th term of the sequence which starts with (11) and (13) is seen to be 


(15) tm = A*/(c + m—1), 
and the m-th step extends the solution initiated by (3) to the ¢-interval 
(16) StSii+-: tm, (t = 0), 


since, corresponding to (12) and (14), 
(17) +tna)| <e+m—1. 

The first m intervals (16) cover the range 0 = t= Tm, where 
(18) 


But, since 31/m == ~, it is clear from (15) and (18) that Tm—o as 
m—>o. Hence, the existence domain of the solution 2;(t) contains the 
whole half-axis 0 = Since ¢ can be replaced by the proof of 
(i) is now complete. 


2. If the functions f; are polynomials, then the assumption (8) of (i) 
is satisfied only if the system (5) is linear. On the other hand, (8) includes 
an extensive class of algebraic, and for that matter rational, functions fi, 
as well as entire functions f; (as exemplified by such types as 


f(z) (142°)¥, cexpsing, 


where n=1). In the linear case, the general solution of (5) consists of 
exponentials e“* and (possibly) of - since ¢ does not occur in 
(5). In particular, every solution is O(e°t), as {> oo, in this trivial case. 
In the non-linear case, no “ explicit” solution of (5) is possible, of course 
(except when n 1). But it turns out that the growth of the solutions in 
the non-linear case is limited by the same estimate as in the trivial case of a 
linear system with constant coefficients : 


(ii) Under the assumptions of (i), each of the n components 2;(t) of 
every solution of (5) is O(e°t) as t—> «, where C ts a constant. In addition, 
C can be chosen so as to be independent of the integration constants (3). 


In fact, it is clear from the proof of (17) that | 2i(t)| << ¢-+ m holds 


for every positive ¢ not exceeding the value (18). On the other hand, it is 
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seen from (15) that, as m—> o, the value (18) is asymptotically equal to 
A*logm. This proves the first part of (ii). The refinement claimed by the 
second part follows by observing that the constant A, being defined by the 
formulation (10) of (8), depends only on the system (5). 

The trivial case mentioned before (ii) shows that the estimate supplied 
by (ii) cannot be improved if only (8) is assumed. However, it is clear from 
the proof of (ii) that C in (ii) can be chosen arbitrarily small if (10) holds 
for every A. In other words, (ii) can be supplemented as follows: 


(iibis) If O(|a]|) tn the assumption (8) of (i) is refined to o(| 2]), 
then O(e°') in the assertion of (ii) can be improved to exp o(t). 


By an adaptation of these remarks, estimates of the growth which corre- 
spond to (ii), (ii bis) can readily be obtained for each of the cases covered 
by the general result (iii) below. 


3. In (i), the differential equations are of the particular type (5), that 
is, ¢ does not occur explicitly. In order to extend (i) to the general case (2), 
let f:,: - -, fn be given as continuous functions of position on the (n + 1)- 
dimensional region (1). The ¢-bound b/M supplied by the local existence 
theorem, used in the above proof of (i), must then be replaced by min(a, b/M), 
where a is the number occurring in (1). Correspondingly, the values (11), 
(13), (15) must now be restricted so as to be within the ¢-range of (1), 
that is, (15) must be replaced by 


(19) tm = min(a, A~?/(c + m—1)). 


Then the extension of the solution to the consecutive intervals (16) is exactly 
the same as in the proof of (i), if it is assumed that the functions f; (which 
now contain ¢) satisfy the inequalities (10), where A is a constant (in- 
dependent of and of t). 

Thus, under the assumption just mentioned, the existence of the solution 
2;(t) follows on the interval 0=¢= 7m, where m is arbitrary and 7'm is 
defined by (18) and (19). However, since 31/m = oo, it is clear from (18) 
and (19) that Tm becomes identical with a when m is large enough. Con- 
sequently, (i) can be extended from (5) to (2), as follows: 


(i*) If are continuous functions of the position (t;21,° tn) 
on the domain (1), and if c,,- + +,¢n are arbitrary, then (2) and (3) have 
a solution 4,—=2;(t) on the whole t-interval, —aStSa, whenever the 
given functions are subject to the n inequalities 


= 


the 
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(20) max | fi(t;%1,° @n)|=O(| |) 


as (9) tends to o. 


Clearly, (i*) implies the classical result as to the unrestricted existence 
of the solutions of linear differential equations. In fact, (20) is satisfied in 
the case (7), where the fix(t) are arbitrary continuous functions on the 
interval —a < ta. Conversely, since the estimate (20) is just what is 
required in the case of linear systems, the exceptional function-theoretical 
standing of the linear systems might suggest that (20) cannot be relaxed. 
However, it turns out that (20) is not the true condition, since (i*) can be 
improved as follows: 


(iii*) The assertion of (i*) remains valid if O(| x |) in (20) is relaxed 


to O(L(| x!)), where L(r) is any L-function satisfying 


co 


(21) f 


This contains (i*), since fdr/L(r) =logr in the case, L(r) =r, of 
(20). And (iii*) is the true theorem, since its integral condition cannot be 
altered, as will be shown below. 


4. The proof of (i), where (2) is of the form (5), is based on the fact 
that the series }1/m diverges. Correspondingly, the proof of (i) remains 
unaltered in cases in which 31/(m log m) or 31/(m log m log log m) takes 
the place of $1/m. On the other hand, what introduced the particular series 
31/m in the proof of (i) was just the linearity of the function, (9), occurring 
in the O-assumption, (8), of (i). Thus it is clear that (i) can be generalized 
as follows: 


(iii) The assertion of (i) remains valid if O(|2|) on the right of (8) 
is relared to O(L(|«|)), where L(r) is any L-function satisfying (21). 


Actually, the assumption that Z(r) be an L-function is not needed to its 
full extent. On the other hand, the assumption (21) is fundamental indeed: 


(iii bis) Corresponding to any L-function subject to 


(21 his) ar/L(r) < 0, 
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there exist systems (5), even analytic systems, which satisfy the assumptions 
of (ili) except the assumption (21), but are such as to make the assertion of 
(iii) false (1. e., such as to possess solutions x(t) which cease to exist at a 
finite t = 


In order to see this, let n —1, and let f,(z,) =f(x) be identical with 
I(x) (for large positive 7,2), where Z(r) is any given L-function satis- 
fying (21 bis). Thus the system (5) becomes the single equation 2’ = L(z) 
(for large x >0). Its solution r—-z2(t) results by the inversion of the 
relation 


(22) t = f dr/L(r) + const. 


(if c is large enough). Since ZL >0, the integral (22) is an increasing 
function of z (> c), and-so the correspondence (22) between ¢ and 2 is 
one-to-one. But it is clear from (22) and (21 bis) that ¢ tends to a finite 
limit as Hence, if ¢° denotes this limit, then z(t) © as 
where ¢t < ft) (<1 ©). It follows therefore from the remarks made after (6), 
that the solution z(¢) cannot be prolonged beyond 2°. 

This proves (iii bis). And (iii bis) implies the assertion made after 
(iii*). Finally, in order to prove (iii*), it suffices to make in the proof of 
(iii) the trivial modification which led from the proof of (i) to that of (i*). 
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CRITERIA FOR THE ABSOLUTE CONVERGENCE OF A FOURIER 
7 SERIES AT A GIVEN POINT.* 


By KigEN-Kwone CHEN. 


) 1. In the case of the convergence of the Fourier series of f at a given | 
4 
" point, the problem is entirely controlled by the behavior of f in the neighbor- 
hood of that point. But the absolute convergence of the series is not a local 
property, and in fact depends on the behavior of the function throughout the 
whole interval of definition. 
Various sufficient conditions for the absolute convergence of the series 
ig 
is (1) $4) + S (an cos nx + by sin nz) 
n=1 
te 
ys" are known.? Any one of these implies the convergence of the series 
F 
(2) (!an| + | dn |). 
er The necessary and sufficient condition for the absolute convergence of a 
of trigonometrical series in the whole interval, is that the series is a Fourier series 
). of a Young’s continuous function.’ 
2. Let h(x) be an even function such that h(x) =2* (0<a<1) for 
and that h(x + 27) —h(x) for every zx. Write 
g(x) = (sin nz) /n, h(z) + 9(x) =f (2), 
n=1 
and suppose that the Fourier series of f(z) is (1). Then we have b, —1/n. 
PI 
he An easy calculation shows that a, == O(n-'*), Thus, the Fourier series of f 
of converges absolutely at the point 
57 i 
(3) 0, 


without the convergence of the series (2). It is perhaps worth remarking 
that the example may be obtained more quickly by setting a, = 0, bn = 1/n. 


* Received February 21, 1944. 
*Cf. A. Zygmund (1), Chapter VI. 
*K. K. Chen (1); G. H. Hardy and J. E. Littlewood (1). 
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The absolute convergence at the origin (3) of the Fourier series can however 
be known by the following criterion.® 


THEOREM 1. Jf 0< p< 1, and the derivative of 


(u) du 
(4) (t{—u)? 
evists asa function of bounded variation i nthe interval 0S t= 7x, where 
(5) $(t) = t) + f(z—t)} 


then the Fourier series of f converges absolutely at the point x. 

In fact, when ¢(t) = $/*, the derivative of the function (4) becomes a 
constant multiple of ¢? which is of bounded variation in (0,7), since a is 
non-negative. 

3. In order to prove Theorem 1, we establish several lemmas. 


Suppose that 0< a<1, x=0, and that the integral of h from 0 to g, 
of order 1— a 


(h(2))s-a— (1/E(1—a)) 


has the derivative (d/dz) (h(x) ):-a at the point z, then we write 
(h(x))-a— (d/dz)(h(2))s-0 


Lemma 1. If the derivative of (h(x))1ra (0<.a<1) eatsts as a 
bounded function in (0,X), then (h(x))q ts equivalent to a function of 
Lip « and 

h(x) 


except for a null set of points z. 
Proof. Write 
 max|g(r)|—A. 
If0S2<2=Y, then we have 
(2) )1-a — SA(2’—2). 


Hence (h(x) ):-a is an absolutely continuous function. 


* Another criterion for the absolute convergence of a Fourier series at a fixed point f 


can be found in the note: K. K. Chen (2). 


at 


i 
| 
i 
i 


point 


THE ABSOLUTE CONVERGENCE OF A FOURIER SERIES. 287 


Since g(t) is bounded, (g(x))a is continuous.* Remembering that 
0<a< 1, we have 


1-0 = “g(t)dt = ((g(2) )a) 


It follows that the two functions h(a) and (g(2))a are equivalent. Hence 
h(x) belongs to the Lebesgue class L°(0,X) for every p>1. Hardy and 
Littlewood have proved that® if p=1,0<a< 1, and h(z) belongs to L?, 
then 


asd—>-+0, It can be deduced from this theorem that 


(6) (h(x) Lip «. 


But it is better to give a direct proof for (6). We have 
(7) (A(z) a} 
0 0 
We may suppose that A(t) is continuous, so that 
4X), 


if M is sufficiently large. Assuming 2’ > 2, it follows from (7) that 
(h(z) SaM f° (a + M(x —2)* 


= M(a’* — 2%) + 2M (2’ — 3M(2/ 


sincee0<a<1. This proves (6). 
Finally, by definition, (h(z)).—g(z). Hence 


((h(®) )-a)a = (9(2) )a. 
Since ((g(x))a is equivalent to h(x), the proof is completed. 


4. Lemma 2. Suppose wz, and that 


Z(w) = fou — u)*"t-4 cos nt dt du. 


‘*G. H. Hardy (1). 
°G. H. Hardy and J. E. Littlewood (2). 
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Then there is a constant C such that 


SC(nw)** (nw <1, 
= (nw <1, 


C(nw)-+ (nw = 1), 


Proof. In the first place, it follows from 


t 
Z(n)— tcosnt ut*(¢t— u)**du dt 


= B(a,g—a+1) f, nt dt 


= (0) 
that 


Z(w) =Z(r) — — u)**t4 cos nt dt du 


=— cos nt — u)**du dt 
w 


(9) —— {cos nt dt 
= (1/n) nt - (w/t?) (w/t)t*(1 — w/t)? dt 


= J (¢ — sin nt dt. 
The second mean value theorem gives two numbers ¢, and ¢, such that 
ty 
(w) = (t — sin nt dt, 
w 


and that w < t, < t,, 
tg 
| (w) | + | sin nt dt |. 


Hence 
| nw*Z (w)| S (2 + 1/a)n*. 


This proves (8) for the case nw = 1. 
Secondly, let nw = 1; we have 


wires (¢ — w)**t-1-4 sin nt dt 


n  _sin(y-+ nw) 
J (y + nw)**4 


*n(1—w) 
0 
1 


ah 


co 
= (nw)*9*{An (y + nw)-tdy + pn 
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where | An | <1, | yn | <1. Since g +1 > a, the integral over (4/2 —1, «) 
is convergent. The integral 


is O(1), when a>q. And if «Sq, then 

I< (nw)-4- (1/a) (x/2)*. 
This completes the proof of the lemma. 


5. Lemma 3. If P(t) is of bounded variation in (0,7) then the series 


are both convergent, where «> 0. 
Proof. The function 
H(t) = min {(nt)*, (nt)-*} 


is bounded in (0,7). In fact, 


nt>1 


This is clearly O(1). The sum of any one of the two series in the lemma, 
is not greater than ( H(t) | dé(t) | which is a finite number, since $(t) 


is of bounded variation. Lemma 3 is thus proved. 


6. Guided hy these lemmas, we were led to the following theorem which 
is an extension of Theorem 1. 


THEOREM 2. [f 0<a<1,q=4, and the function 


d u%p(u)du 


(10) y(t) uJ, 


exists and is of bounded variation in (0,2), then the Fourier series of f con- 
verges absolutely at the point z. 


This reduces to Theorem 1, when a = q = p. 


Proof. We have to deduce the absolute convergence of the series 


(11) (2/n) XO cos ntdt 


n=1 
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from the fact that the function y(¢) is of bounded variation in (0, 7).°® 
In virtue of g =a, the function 


1 d c¢tu%p(u)du 
(t))-a—= dt (t—u)@ 


being the product of y(t) and t**/I'(1 — a), is of bounded variation in (0,7). 
It follows from Lemma 1 that the relation 


p(t) 


holds almost everywhere. Therefore the n-th term of (11) can be written as 
f, t-4 cos nt - ) (t-—u)*1(u%p(u) )-adudt 
0 0 
= (1/T(a)) f, (wou) )- cos ntdtdu 
0 


= (1/0(2)) (u)) (u). 
0 
On integration by parts, we obtain 


T'(a)P(1— a) #8) cos ntdt — Z(w)ay(w) 


since Z(0) =Z(r) =0. Hence, by Lemma 2, 
*1/n 
(12) | { $(t) cos ntdt | = (C/n) f (nw)*-* | dy(w) | 
+ (C/n) | dy(w)|. 
1/n 


The absolute convergence of (11) follows from (12) and Lemma 3. The 


theorem is thus proved. 


7. If we equate a and g in Theorem 2, we obtain Theorem 4. In particular, 
if the derivative ¢’(t) exists and both ¢(t) and t¢’(t) are of bounded variation 
in (0,7), then 

d d (tv) dv 


1 yP(h(tv) + tud’(tv) )dv 
(1—+)? 


*°Cf. L. S. Bosanquet (1). 


The 


ilar, 
tion 
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which is evidently also of bounded variation in (0,7). Hence we obtain the 
following 


THEOREM 3. If 6(t) and td’(t) are both of bounded variation in (0, 7) 
then the Fourier series of f converges absolutely at the point x. 


co 
8. Write o,* for the n-th Cesaro mean of order « of the series > dn. 


n=0 
Thus, writing (a), = for >—1, 


T(n+1)r(é+1) 


ont = (1/(2)n) (a) 


v=0 


The series Xa, is said to be absolutely summable (C, a) or summable | C, « |, 
if the series 


co 
| — On-1" | 
n=0 
with o-,* = 0, converges. In this case, letting on*—>s, we write 
20 
an = | % |, 
n=0 
In particular, summability | C,0| is the same as absolute convergence. 
Lemma 4. If a >—1 and Sa,—s|C,«|, then for e>0, 
=s|Crate|. 


This important theorem is due to Kogbetliantz.*. It seems however that 
the proposition is new in the case —1<a< 0. We give a proof here. 


Proof. Without loss of generality, we can assume that e< 1. Writing 
z+ «= , we have, by Abel’s transformation, 
n 
(B) non? = (€ — 1)n-v(@) vov* 
v=0 


n=1 v 


= (01% — mt) 1) nu + 
v= 
It follows that 


onP — —=((%)n/(B)n) (on® — on-1*) +S — (n, v), 


*E. Kogbetliantz (1), (2). 


). 
\ 
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where 


If N > 2, then we have 


N 

| on? — on- | = | On | | ( (ov*— oy41%) (n, v) 

The first term is O(1), as N-> o, since (B)n > (a)n. The last term is 
equal to 


— | |(n, v) |. 


n=Vv+2 


N 
Hience it is enough to show that & |(n,v)| is uniformly bounded in N and ». 
v+2 


There is a constant K such that 


N v 


n=2y+1 n=2y+1 
This is equal to 


K(a+1)) n2*=0(1), 


n=2y+1 


uniformly in NW and ». 
If v+ 2S then (B8)n-1 O(v*) and 


| =— (a) )u(e-- 2 2) nop 


n=v+2 n=v+2 p=0 


n=V+ 


1) — (e— ] Joy 


v 
This is less than Hence 
u=0 


(1/(B) n-1) = (%) n(e— 2) np = O(v'8) (B)v = O(1). 


n=V+2 


Therefore we have 


- 
| N 
| 


v. 
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|(n,v): | = 0(1). 


In a similar manner, we can prove 


| 


n=p+1 
The proposition is thus established. 
9. It has now been shewn that the absolute summability of negative order 


is a property which is stronger than absolute convergence. Let us next con- 
sider the absolute summability of negative order. For this purpose, we write 


[x(t) Ja + a) )a 
as the mean function of order « of x(t). Thus 


[x(¢) Jo = 
[x(t) ]a = (a@/t*) du (2>0,¢>0), 


provided that y(¢) is integrable in the Lebesgue sense over (0,¢)). The 
latter equation in the case 0 < a < 1 holds good for almost all values of ¢ in 
(0, ¢)).° The mean functions of negative order exist with additional hypotheses. 


THEOREM 4. Suppose that and that at the point 
x the function 


(13) t-a[t1p(t) Ja 


is of bounded variation in (0,7); then, for « > 0, the Fourier series of f is 
summable | C,—a-+el|. 


Before giving the proof of this theorem it is convenient to prove the fol- 
lowing three lemmas. 


10. Lema 5. Write g*(n,t) for the n-th Cesaro mean of order « of 
If —1<B<0,° then 


| g8(n, t)| S Ant(1 + nt)-?4, (¢> 0). 


®°G. H. Hardy (1). 
a=0, then ga(n,t) =-O(1+ nt)-2+ 0(1+ nt). See Obreschkoff (1). 
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Proof. In fact, we have 


g®(n, t) > (B 1) n-v sin vt. 


Abel’s transformation gives 
n-1 
(B—1)n-vsin vt = > (8) vAsin (n-—v)t 
v=0 v=0 
n-1 
= — 2sin(t/2) (B)v cos(n —v— $)t 
v=0 


which is numerically less than a constant multiple of tn*-®. This proves the 
lemma for the case nt = 1. To complete the proof, we rewrite 


(B—1).vsinvt = ( + )(B—1)vsin(n— 
y=0 nt>1 
Now, if nt > 1, 


| (@—1)ysin(n—v)t |= S At, 
y>1/t 


vt>1 
(8—1)vsin(n—v)t =— 2sin(t/2) (B)vcos(n—v—4)t + 
rts 


; Therefore we obtain 
g®(n, t) = O(nt)-8, 


The lemma is thus proved. 
From the proof of Lemma 5, we can state the following result. 


Lemma 6. If —1<B <0, nt —1, then the relation 
(1/(B)n) (8—1)vsin(n—v)t = O(nt)* 
vtS1 


holds uniformly for 


11. Lemma 7%. Suppose that —1<—a«<B<0,1+q>a—£ and 
that 


(14) Z,(w) w)*14-4(d/dt) g®(n, #) dtdu. 
0 


Then forO0 wz, 
(15) | Zp(w)| (nw)-8(1 nw)-*, 


i 


he 


By 


nd 


Therefore we obtain 
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Proof. As in the proof of Lemma 2, we have 


Zp(m) = B(a, q—a+1) (d/dt) g®(n, t)dt = 0. 


Hence 
T a1 
(16) Zp(w) = — f (d/dt) g9(n,t) — 
w w/t 
= — t) dt. 


Write g®(n, 1) =g,(n,t) + g.(n, t) with 
[n/2] 
(B)ngi(n, t) = (B—1)vsin (n—v)t. 
y=0 


Firstly, assuming nw = 2, let us consider the following two integrals 


Zpi(w) = witea — w)*1t-1-49;(n, t) dt; (j= 1,2). 


By Lemma 6, we have 


2>w+i/n 


(17) f (t — w)*#-2-49, (n, t) dt 


w+l/n 

w*t-BO (n wy Pf (¢ == O(nw)-*8, 
w 

The second mean value theorem gives a number w, for which w < w, < 2uw, 

such that 


(t (n, t)dt = O(n) (nw)-* t) dt. 
g g 
wtis/n wi /n 


We have 


— cos (n — wr 
,t)dt = — 
Since ¢-? < n/2, we have 


And in virtue of B < 0, we have 


(B—-1)» cos (n —~y)t 


vt>1 


== O(n-t-8), 
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(18) wita-a (t — w)* (n, t)dt = O(nw) 
w+i/n 
Integrating by parts, we have 


"(t—w) (t) dt 


n/2 (8 —cos(n—v)t]*™ 
a-1f-1-q 
n/2 cos(n v)t dt 


a-14-1- 


= O(n--8) + — w) (nt) Pal, 


since, for ¢ > w, 


— (d/dt) {(t —w) = — +q—a)t— (1+ 
> a)w > 


Now, we have 


T 
f. — (d/dt) {(t — w) 


= 0(1) + _ — , 
since g-+2>a—8. It follows that 
(19) (t — (t) dt = O(nw) 


Collecting (17), (18) and (19), we obtain 
(20) Z p(w) = O(nw)-*F8, (nw = 1). 
We are now in a position to consider Zg'(w). We have 


wee —w) (n, t) dt 


J U 


0 


since the function (B)ng2(n,t) = S (8—1)n-vsinvt vanishes at 0 and z. 


Therefore we may write 
(21) Zp (w) = f t-4(dg./dt) f —u)*"*dudt 


w 1 
+ 
0 0 


ai 
| 
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The last term is a constant multiple of 


(22) (dg./at) dt = 9g2(n, w) = (B—1) sin vw 
= (1/(B)n)O(n**) max | sin w $riy sin2w-+----+ sin pw | 
= O(nw)", 


since (8 —1)n-, is increasing in vy. And it follows from the second mean 
value theorem that 


T b 
(23) (dg./dt) B(a, 1+ q—a) f (dg2/dt) dt 


Combining the results (20), (21), (22) and (23), we obtain 


(24) Zp(w) =Z,'(w) +Z(w) = O(nw)-*, (nw = 2). 


~ 


Secondly, let nw < 2. Using Lemma 5, we have 


22w 


w 
(25) (t — (n, t) dt = f (t — w)*t-?-90 (nw) dt 


w 


== —-w)*"0(nw)dt = O(nw). 


By Lemma 5, = O(nt)-8 fort >0. It follows that 


(26) wita- (t — w)*1t-1-1g8 (n, t)dt = O(n-8) f (t — w)**t-1-0-Bdt 
2w e/ 2w 
== O(nw)-8 f (v — dy = O(nw)-8, 


2 
00 


since the integral f (v—1)*'v7?-*8dv converges, by the hypothesis 


a—B<1+q. From (25) and (26) we have 

(27) Zp(w) = O(nw)-8 (nw = 2). 
The two relations (24) and (27) establishes the lemma. 
12. Proof of Theorem 4. By hypothesis, the functions 


1 


and ¢2-* are of bounded variation in (0,7). Hence the product 
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tup(u)du 


u)* —=T(1— a) (t%¢(t) )- -a 


(t) = 
is bounded in (0,7). It follows from Lemma 1 that 
(28) b(t) = ((t%p(t) )-a) at, 
except for a null set of points t. 


Write and for the n-th Cesaro mean of order of 
(2/m) fo (d/dt) sin ntdt and (2/z) (t) cos nt dt 
0 


respectively. An easy calculation gives 


on? On-1° n-*r,8. 


Assuming —1<—a<B<0. We have to show that & | n-*r,8 | con- 
verges. By (28), we have 


As in the proof of Theorem 2, this can be rewritten as 


It follows from Lemma 7 that 
/n 
|S | dy(w)| + Ant | 
0 1/n 


On account of —8>0 and —a—f'< 0, the convergence of the series 
=| n-*r,° | follows from Lemma 3. This establishes the theorem. 


Putting « = q = p, it has been shown that the function 


d ud(u)du 


iJ, G—uy  OSP<)) 


is of bounded variation in (0,7), if both ¢(¢) and t¢’(t) are of bounded 
variation in (0,7). It follows from Theorem 4 that the series & | n-*ry5| 1 
converges for 8 >—p. Hence the Fourier series is summable | C,a| for 
every «> —J1. We have now obtained the following 


4 

q 
4 
if 
i 

i 
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THEOREM 5, If $(t) and t¢’(t) are both of bounded variation in (0, 7) 
then the Fourier series of f, at the point x, is absolutely summable (C,«) for 
every «>—1. 


NATIONAL UNIVERSITY OF CHEKIANG, 
MEITAN, KweicHow, CHINA. 
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THE RADICAL AND SEMI-SIMPLICITY FOR ARBITRARY 
RINGS.* 


By N. Jacosson. 


The radical of an algebra with a finite basis, or, more generally, of a ring 
Y that satisfies the descending chain condition is defined to be the join of the 
nil right (left) ideals of 2%. The importance of the radical for the structure 
theory of these rings is due to the facts that 1) the radical Jt is a two-sided 
ideal whose difference ring 2{ — 9 is semi-simple in the sense that its radical 
is 0, and 2) the structure of semi-simple rings satisfying the descending chain 
condition can be subjected to a thorough analysis that leads in many important 
cases to a complete classification. Several investigations of nil ideals in arbi- 
trary rings have been made recently but none of these has led to a structure 
theory for general semi-simple rings.” This is one of a number of indications 
that in order to develop a satisfactory structure theory for arbitrary rings it 
is necessary to abandon the concept of a nil ideal in defining the radical. 

Other possibilities for defining a radical are afforded by. two important 
characterizations of the radical 3t of an algebra Mf with a finite basis. One of 
these, due to Perlis, makes use of the notion of quasi-regularity.* An element 
z of is right quasi-regular if there exists a in such that z+ zz’ = 0, 
Perlis has shown that z« 9 if and only if w+ z is right quasi-regular for all 
right quasi-regular u. A second characterization of 9 for algebras with an 
identity is that Jt is the intersection of the maximal right (left) ideals of 2.* 
A start in the investigation of the first characterization as a possibility for 
defining a radical for an arbitrary ring 2 was made by Baer, who showed that 
the totality # of elements z that generate right ideals containing only right 
quasi-regular elements is a right ideal.° 

The point of departure of the present investigation is the observation 


that KR is a two-sided ideal and that 8 coincides with the two-sided ideal 


* Received September 6, 1944; Revised January 5, 1945. 

1 Presented to the Society October 28, 1944. 

2 Baer [1] and Levitzki [1] and [2]. 

Perlis [1]. 

* Jacobson [1], p. 66. Cf. also G. Birkhoff [1]. 

5 Baer [1], p. 562. This definition of the radical has been independently proposed 
by Hille and Zorn who proved that §{ is a two-sided ideal and that if 9[ has an identity, 
§ is the intersection of the maximal right (left) ideals of 9{. These results were 
announced by Professor Hille in his Colloquium Lectures in August 1944. 
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defined in a similar manner using left quasi-regularity and left ideals. We 
define the radical of 2 to be the ideal 9. In the first part of the paper we 
establish a number of “radical-like” properties of Jt and we investigate # in 
several important special cases. For rings that satisfy the descending chain 
condition it = 3 where Jt is defined as above. If %& is a commutative normed 
ring # coincides with the radical as defined by Gelfand to be the totality of 
generalized nilpotent elements. A similar characterization holds for non- 
commutative normed rings. 

In the latter half of the paper we investigate the relation between ft and 
the intersection of the maximal right ideals of &. We show that if Y-R 
then = ILX for the maximal right ideals of and WJ = AR. If W has 
an identity, 3} = Ly. For arbitrary rings that contain maximal right ideals 
we show that }t is the intersection of certain two-sided ideals 8 whose difference 
rings are of a special type called primitive. 

This result implies that any semi-simple ring is isomorphic to a "lillit 
of the complete direct sum of primitive rings. To a certain extent this reduces 
the study of semi-simple rings to that of primitive rings. A tool for studying 
the latter is a representation theorem for these rings that states that a primi- 
tive ring is isomorphic to a dense ring of linear transformations in a vector 
space over a division ring. These theorems are analogues of the fundamental 
Wedderburn-Artin structure theorems for semi-simple rings that satisfy the 
descending chain condition. The Wedderburn-Artin theorems can be deduced 
quite simply from our results. Moreover, our results contain as special cases 
Stone’s theorem on Boolean rings and other known results on the representa- 
bility of rings as subrings of direct sums of fields.* We can also obtain from 
our theory a theory of algebraic algebras that is fairly conclusive for algebras 
with elements of bounded degree. These results will be published in a subsequent 
paper. 

I am indebted to R. Baer for a number of important suggestions that led 
to simplifications of some of the proofs and to extensions of several of the 
theorems from rings with an identity to arbitrary rings. 

1. Definition of the radical. Let Y be a ring with an identity and 
let z be an element of % such that 1-+-z has a right inverse wu. We write 
u=1+2’. Then (1+ 2)(1+ 2) =1 implies 
(1) a+ =0. 

Conversely if z is an element for which there exists an element 7 satisfying (1) 
then 1+ z has the right inverse 1-+ 2’. This leads to 


7 Stone [1], McCoy and Montgomery [1], McCoy [1] and [2]. 
®° Gelfand [1], p. 10. 
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Definition 1. An element z of an arbitrary ring Y is right quasi-regular 
if there exists an element 2 in % such that z+ 7 + 27 =0. The element z’ 
satisfying this equation is called a right quasi-inverse of z. 

We have noted that if % has an identity then z is right quasi-regular with 
right inverse 7 if, and only if, 1 +- z has the right inverse 1 + 7. 

If zis any element of % the totality {x + zz} of elements x + za where 
a ranges over YU is a right ideal. If z is right quasi-regular with right quasi- 
inverse then — + 22’ e{x-+ Hence {x + zz} and re {x + 22}. 
Then {z+ zz} On the other hand if {r+ zr} = then —z— 2’ 4+ 27 
for a suitable 7 and z is right quasi-regular. Hence we have the alternative 


Definition 1’. An element z of a ring %& is right quasi-regular if the 
totality of elements {x + zr} — %.° 

A right ideal X will be called quasi-regular if all the elements of ¥ are 
right quasi-regular. Let 3: and 32 be quasi-regular right ideals and let 2: € Qi, 
a1=1,2. There exists an element 2, such that z, + 2, + 2,2,;—0. Since 


/ 


Zo + 222’, € §2 this element has a right quasi-inverse w’ such that 


(22 + + w’ + (22 + 2021)” = 0. 


Hence 


(21 + 22) + (41 + + + (4 + (41 + w’ + Zw’) 
=[4at 71+ + (atari) + (a+ 


+l(at =0. 


Thus 7, w’ + is a right quasi-inverse of + 2. Hence + is 
quasi-regular. 

Now let 9% be the join of all the quasi-regular right ideals of Y. Since 
the right ideal generated by an element z is the totality of elements zi +- za 
where # is an integer and ae 2%, it is clear that # is the totality of elements z 
such that zi + za is right quasi-regular for all integral 1 and all a in Y. The 
above result shows that Rt is a right ideal. We wish to show that # is a two- 
sided ideal. Let ze Rt and be M. Then zbe# and there exists an element w’ 
such that zb + w’ + (zb)w’ =0. Then 


bz + (— bz — bu’z) + bz(— bz — bw’z) = — b(w’ + 2b + zbw’)z=0 


and so —(bz-+ bw’z) is a right quasi-inverse for bz. Similarly if 1 is an 
integer and (bz)i + (bz)a = b(zi za) is right quasi-regular. Hence 


bzeR and we have proved 


® This definition of quasi-regularity is due to Baer. 
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THEOREM 1. Jf Mis an arbitrary ring the join ® of all the quasi-regular 
right ideals of I is a (right) quasi-regular two-sided ideal. 


Definition 2. The radical of a ring is the join of all the quasi-regular 
right ideals of the ring. 

We have seen that ® is the set of elements z such that zi + za is right 
quasi-regular for all integers 7 and all a in YM. If Wf is a ring with an identity, 
the connection between quasi-regularity and regularity shows that # is the 
totality of elements z such that 1 + za has a right inverse for every a in %. 

A similar discussion holds for left ideals. We define left quasi-regularity, 
left quasi-inverse, quasi-regular left ideal and left radical 9’ in a manner 
analogous to the above. As for ordinary inverses we say that an element is 
guasi-regular if it is both right and left quasi-regular. 


LeMMA 1. If z is quasi-regular any right (left) quast-inverse is a left 
(right) quasi-inverse, is uniquely determined and commutes with z. 
Let @ be a right quasi-inverse and 2” a left quasi-inverse. Then 


and 


+ (z+ 27+ 22) (z2+7+27) 
(24-7472) 4 =7. 


This proves the first two statements. Since 
72, == 22, 
We call 2 the quasi-inverse of 2. 


Now let ze %t. Then z has a right quasi-inverse —=—z— zz’. Since 
Rt is an ideal 2 «Rt. Hence 2 has a right quasi-inverse. Since z is a left quasi- 
inverse of 2 it follows by Lemma 1 that 7 is quasi-regular and z is its quasi- 
inverse. Hence z is quasi-regular. Since #t is a left ideal # S the left radical H’. 


By symmetry = This proves 


THEOREM 2. The radical of a ring Y is the join of all the quasi-regular 
left ideals of %. 


By the Lemma the elements of 9t are quasi-regular. 
2. Elementary properties of the radical. If %f is an arbitrary ring 


we know that we can imbed & in a ring with an identity M%* such that 
Y* — 9+ (1), WMA (1) =O where (1), the ring generated by 1, is iso- 
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morphic to the ring of integers.® At first we assume only that 2* = % +- (1). 
Let be the radical of and let ze R(A*) A W. Then z has a quasi- 
inverse 2’ in Since = —z— zz’, and z is quasi-regular in YW. 
Hence (R(U*) AM) R(W). Since (1) any right ideal of W 
is a right ideal of Hence R(M) = Thus R(W) — R(A*) A W. 

Suppose now that % A (1) =0 and that (1) is isomorphic to the ring 
of integers. Then if z*«%#(%*) the coset 2* of z* in the difference ring 
%* — Y is in the radical of this ring. Since the radical of the ring of integers 
is 0, 2*=0 and z*e%. Hence = R(U*). From the equation 


—R(X*) A we obtain R(X) — R(A*). 


THEOREM 3. Let YU be an arbitrary ring and let U* be a ring with an 
identity containing such that A*—A-+ (1). Then the radical 
= R(M*) AW. If in addition UA (1) —0 and (1) is isomorphic to the 
ring of integers then R(N) = K(A*). 


We shall call & a radical ring if U=—R. If R—O0, W is semi-simple. 
For many problems the following theorem effects a reduction to the con- 


sideration of these two extreme types of rings. 


THEOREM 4, If ft is the radical of UX, U— W— RK ts semi-simple. 


Let Z be an element of the radical of 2 and let z be an element in the 
coset Z. Then there exists an element 7 such that z+ 27 + 27 —=uet. Also 


there exists a uw’ such that u+ w+ uv’ =0. Hence 


O= (2+ 27) 
+2). 


n-1 
If z is a nilpotent element of index n, 7 = > (—1)‘z* is a quasi-inverse 
1 


of z. Hence we have the following 


TurorEM 5. The radical of a ring contains every nil right (left) ideal 
of the ring. 


As a consequence of Theorems 4 and 5 we may prove the 


Corotuary. If z is an element such that then ze R. 


® Albert [1], p. 22. 


Thus z is right quasi-regular. Since the totality of elements z in the cosets 2 
of #(2) is an ideal, this totality is a quasi-regular ideal. Hence ze ft and 
Z == (), 
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For if § is the right ideal generated by z, 3°? U2M=R. Hence 
(3 + #) — is nilpotent in the semi-simple ring X—A—NR. Hence 
=0, JSR and ze KR. 

This corollary implies that ze Rt if and only if za (az) is right (left) 
quasi-regular for all a in %. 


C2) 


The elements of the radical need not be nilpotent. This can be seen in 
the example: % the ring of p-adic integers. By applying the definition directly 
or by using Corollary 2 to Theorem 18 we can show that kt = pM. However, 
no element of I is nilpotent. Some information on the nature of the elements 
of ® can be obtained from the following 


THEOREM 6. Jf Jt is a subring of R and zeN, then for any positive 
integer h either > or 


Evidently = Suppose that Then = for 


some y in Jt. Let vy be a quasi-inverse of —y. Then 
= zh — zhy + — = y+ yy) = 2. 
YY 


This theorem implies that the radical contains no idempotent element 
~ 0. It is known that if % is a ring with an identity whose lattice of right 
ideals is completely reducible, then every right ideal in & has the form 


~ 10 


eM where e is an idempotent element in Hence we have the following 


THEOREM 7%. If is a ring with an identity whose lattice of right (left) 
ideals 1s completely reducible, then UX is semi-simple. 


We shall show next that any ring which is regular * in the sense of von 
Neumann is semi-simple. We recall the definition: % is regular if every ele- 
ment a of % has a relative inverse u such that aua—=a. Suppose that ae ft. 
Then — wa has a quasi-inverse v such that —ua-+v—wuav=—0. Hence 


0 = — aua + av — auav = — a -+- av av >= — 
THEOREM 8. Any regular ring is semi-simple. 


3. The radical of a ring satisfying the descending chain condition. 
We suppose now that %f is a ring for which the descending chain condition for 
right (left) ideals holds. Let 9t be a two-sided ideal contained in # and sup- 
pose that 2? — MN. Then if 9540 there exists a minimum right ideal Y of W 
with the properties 1) SM, 2) 


10 See, for example, Jacobson [1], p. 65. 
11: Von Neumann [1]. 
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Let 6 be an element of such that Then (bR)R— 
and since b3¢t = we have by the minimality of J. Since there 
is an element y in Jt such that by 6. As in the proof of Theorem 6 this 
leads to b = 0 contrary to D540. Thus —0. Now the positive integral 
powers #i* of # are two-sided ideals andR=R*?=---. Hence there is an 
integer p such that Then for 3 MR? we have N?—MN. Hence 
—0. This proves 


THEOREM 9. If WU is a ring that satisfies the descending chain condition 
for right (left) ideals, then the radical of X is nilpotent. 


Since any nil ideal is contained in the radical this proves that any nil 
ideal in a ring that satisfies the descending chain condition for right (left) 
ideals is nilpotent. It is clear also that 9 coincides with the usual radical 
defined as the join of all nilpotent ideals. 


4. Finitely generated ideals contained in {. 


THEOREM 10. If 9 is a right ideal with a finite basis contained in the 
radical KR then either NR < or M—O0. 


Since Jt is a right ideal either NR<M or RR —M. Assume that 
MR Let +, yn be a basis for M. -Then every element of has 
the form Syiai + Syiji where the a; and the 7; are integers. Since RM 
every element of Jt also has the form yizi, 2; in Rt. In particular y; = Syizi. 
Let 2’; be the quasi-inverse of —z,. Then we have 


— 91%) + (% — 121) (2; + Zj21). 


Hence y; can be eliminated from the basis. Similarly every y; can be eliminated 
and so 0. 

If % is an arbitrary ring we define the transfinite powers M* by the 
conditions 1) = M, 2) — 3) if a is the limit ordinal is the 
join of all 2 with B< a. There exists a least ordinal p such that 2? = 1". 
We shall call p the index of & and we shall say that %& is transfinite nilpotent 
if 240. Now suppose that Y& is a ring that satisfies the ascending chain 
condition for right ideals. We recall that this condition is equivalent to the 
requirement that every ideal has a finite basis. Let Jt =? where p is the 
. index of the radical of Then MI — M and so, by Theorem 10, — R? 0. 


THEOREM 11. The radical of a ring that satisfies the ascending chain 
condition for right ideals is transfinite nilpotent. 
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5. The radical of a matrix ring. If %M is an arbitrary ring we denote 
as usual the ring of n X m matrices with elements in %& by %n. If B is a sub- 
ring (ideal) in %&f then By is a subring (ideal) in Mn. Let 9 be the radical 
of 2. Then we wish to show that 9n is the radical R(An) of Wn. 


LEMMA 2. Any matrix z= (2ij) of Un in which a1 is right quasi- 
regular and the zi; =0 fori > 1 is right quasi-regular in Mn. 


Since 2, is right quasi-regular we know that the ideal {x + 2,7} = YW. 
Hence there exist elements 21; such that 214 + 212/11 =— 24. Then if we 
set 2’ij = 0 for i > 1 we may verify directly that 7 = (7i;) is a right quasi- 
inverse of 2. 

Consider now the totality 3: of matrices with first row consisting of 
elements in # and other rows 0. Then %: is a right ideal. Hence by the 
lemma 9: S W(Y%n). Similarly the totality 3; of matrices with j-th row con- 
sisting of elements of ft and other rows 0 is contained in #(W%n). Since 


Conversely let Y = (yij) «Rt (M,). If a is any element of 2 we denote 

the matrix that has @ in the (1,7) position and 0’s elsewhere by Aij. Let a 

and 6 be arbitrary and form the matrix D=)> AmYBa. Then D is the 
k 


diagonal matrix @Ypqb} and De R(Un). If D’ = (dij) is a right 
auasi-inverse of D it is easy to see that d’ = d’,, is a right quasi-inverse of 
d = dYpqb. Evidently this implies that for arbitrary c in YM, and arbitrary 
integral i, de + di is right quasi-regular. Hence det. Since a and 6b are 
arbitrary the Corollary to Theorem 5 shows that ypqe St and Y € Rin. 


THEOREM 12. Jf YW is an arbitrary ring and Un is the ring of nX n 
y 
matrices with elements in XM, then the radical the radical of 


6. The radical of an algebra. Let 2M be an algebra of possibly infinite 
order over a field ®. By an ideal in the algebra 2f we mean, of course, an ideal 
of the ring Y& that is invariant under the scalar multiplications > ra, « in ®. 
If & has an identity, za = w(1a«) = (1a)z and so any ideal of the ring % is 
an ideal of the algebra Y. 

The above discussion is valid without change for an arbitrary algebra 2. 
Thus the radical # of 2f can be defined to be the join of all the quasi-regular 
right ideals of 2. 9 is also the maximum quasi-regular left ideal of Mf. All of 
our theorems hold for algebras. 

An element a of an algebra % is algebraic if it satisfies a non-trivial 
algebraic equation with coefficients in the underlying field ®. An equivalent 
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condition is that a generates a subalgebra A with a finite basis. As in the 
special case of a field, an element which is not algebraic will be called 
transcendental. If every element of I is algebraic, then %&f is algebraic. If a 
is an algebraic element and A is the subalgebra generated by a then there 
exists a positive integer h such that a’”*A = a*A.. Hence if a is in the radical, 
ly Theorem 6, a is nilpotent. This proves 


THEOREM 13. If Y& is an algebra over a field, the elements of the radical 
R of W are either nilpotent or transcendental over ®. 
If Mf is algebraic every z in %t is nilpotent. Hence, since # contains every 


nil ideal, we have the following 


THEOREM 14. The radical of an algebraic algebra U is the join of all 
nil right (left) ideals of YU. 

If 2% is commutative, an element z generates a nil ideal if and only if it 
is nilpotent. Hence we have the 


CoroLuARy. If is a commutative algebraic algebra, the radical of 
1s the totality of its nilpotent elements. 


7. The radical of a normed ring. We suppose now that %& is a normed 
ting, i.e., & is an algebra over the field ® of complex numbers and for each 
a in & there is defined a non-negative real norm || a || such that 


1. |lal| >0 if | 0 | = 0 

2 

3. if ae® 

4, iol 

5. has an identity and || 1 | —1 

6. is complete relative to the metric D(a,b) = ||a—6d || 


Following Gelfand we call an element of Y& a generalized nilpotent ele- 
ment if lim || 2 ||'/*—0.'* For commutative normed rings Gelfand has 
defined the radical to be the totality of generalized nilpotent elements. We 
shall show that this set coincides with the radical as defined here and we shall 
obtain a similar characterization of the radical for non-commutative normed 
rings. We prove first the following 

THEOREM 15. The radical of a normed ring YX is the totality of elements 
z such that (za)"—>0((az)"— 0) for every a in %. 


12 Gelfand [1], p. 10. 
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Since Wf has an identity # is the totality of elements z such that 1+ za 
has an inverse for every a in %&. Now suppose that z is an element such that 
(za)"—>0 for every a. Then for any @ in ® || (z«)" || <1 for n sufficiently 
large. Thus || 2" || < 8" where B=1/|a|. We choose @ so that || >1. 
The series 1—z-+2*—- ~:~ is ultimately dominated by the convergent 
series 1+ 8+ 8°+---. Hence 1—z-+ exists in and this 
element is the inverse of 1+ 2. Similarly we can show that if 2’ = za, then 
i+ has an inverse. Hence Conversely suppose that ze Rt. Then 
1+ za has an inverse (1+ za)-+ for every « in ®. Using the fact that 
(1 + za) is an analytic function of # we may prove, exactly as Gelfand has 
done in the commutative case, that (z«)"—>0."* In particular z"—>0 and 
since Jt is an ideal (za)"—>0 for every a. 

We shall call an ideal a generalized nil ideal if all of its elements are 
generalized nilpotent elements. If ze Rt, (z«)"—>0. Then for n sufficiently 
large ||(za)" || <1 and || 2"|| << B" for B=1/|a|. Hence OS] |?" < 
Since f is arbitrary lim || z”||*/"—0. Thus z is a generalized nilpotent ele- 
ment and t is a generalized nil ideal. Next let § be an arbitrary generalized 
nil right ideal. Then if ye %, || y” Hence y"—>0. Since is a right 
ideal, y’ = yae § and (y/)"— 0. By Theorem 15, ye and so $M. This 
proves 


THEOREM 16. The radical of a normed ring is a generalized nil ideal 
that contains every generalized nil right (left) ideal of the ring. 


Let 9 be commutative. Then if z and ae YW, 
\(za)" |] =f rar |S jar 


Hence ||(za)” ||*/"S || 2” - || || and if z is a generalized nilpotent ele- 
ment then za is a generalized nilpotent element. Thus any generalized nil- . 
potent element generates a generalized nil ideal and §t is the totality of gen- 


eralized nilpotent elements. 


8. Quotient ideals. We return to the consideration of an arbitrary 
ring Y. The results that we obtain are also valid for algebras but we shall 
not state them explicitly for these. 

Let 3 be a right ideal in YM. Then if ae WM the right multiplication 
a—> «xa determined by @ induces an endomorphism @ in the difference group 
— The mapping sends the coset x + into za+ The totality 


of elements @ is a subring Wf of the ring of endomorphisms of Mt and the 


13 Gelfand [1], p. 10. 
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correspondence a—>4@ is a homomorphism between % and %. The kernel of 
this homomorphism is a two-sided ideal 3: YM which we shall call the quotient 
of relative to U. Evidently S and if has an identity, YW) 
is the largest two-sided ideal of 2 contained in 3. By the fundamental 
theorem on homomorphisms % = % — (3: 2). 

Let § be maximal, i. e., 2 > YJ and there is no right ideal 3’ such that 
Then PM is irreducible relative to %. As usual we 
call a ring of endomorphisms % irreducible if the group Mt in which % acts 


is irreducible. Let % = 0 have this property. Then the totality 8 of elements 


z in M such that 2% —0 is a subgroup of Mt invariant under M. Hence either | 


83 —=—0or 83—M. Since 8AM and so 8—0. It follows that if 
is any element ~0 of Mt then aU ~0. Since 2M is a subgroup invariant 
under %, 2% — Mt. We use this to prove 


THEOREM 17. Any irreducible ring of endomorphisms is semi-simple. 


Let Z be an element of the radical of 2 and let =~ 0! be arbitrary in M. 
If —M. Hence there is an in % such that The 
element — 2é has a quasi-inverse 2’. Hence 


— — + = + (x — —0. 


This contradiction shows that «2 —0 for all z. Thus 70 and % is semi- 
simple. 
This theorem has the following important consequence. 


Corotnary. If § is a maximal right ideal (3:2) contains the radical 
Ht of W. 

If (3: there is nothing to prove. Hence suppose that (3: ~ 
Then = ($:%) is an irreducible ring of endomorphisms ~ 0. If 


the coset Z = z + (9: %) is in the radical of &— (3: MW). Since is semi- 
simple, Z=0. Hence ze and 


9. The radical as intersection of maximal right ideals. Let 2 be any 
ring that is not a radical ring. Then % contains an element a which is not 
right quasi-regular. Hence the right ideal {~-+ ar} does not contain 4. 
Moreover, if & is a right ideal that contains a and contains the ideal {xz + az} 
then § = 2%. By using Zorn’s maximum principle we may prove 


Lemma 3. If a is an element of UX that is not right quasi-regular the 
right ideal {x + ax} can be imbedded in a maximal right ideal, 
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This shows, of course, that any ring that is not.a radical ring contains 
maximal right ideals, We assume now that %& is any ring that contains 
maximal right ideals, Consider the intersection IT of the maximal right 
ideals 3 of UM. Let ye IL¥. Then y is right quasi-regular. For otherwise 
{x -+- yx} can be imbedded in the maximal right ideal §. Then ye% and 
3 = UW contrary to the maximality of 3. Thus every element of IQ is quasi- 
regular and since IL¥ is a right ideal, INKY = %. On the other hand, by the 
corollary to Theorem 17, RS(3:M). Hence and 
UR SITY. This proves 


THEOREM 18. If YU is a ring that contains maximal right ideals and 113 
is the intersection of these maximal right ideals, then II3 SR and AR S IIS. 


Corottary 1. Jf YU is not a radical ring and IL is the intersection of 
the maximal right ideals of UL then ISSR and AR S IY. 


CoroLiary 2. If UW is a ring with an identity the radical of U is the 
intersection of the maximal right ideals of %. 


For W—R. Hence R= as well as 

The following results also are consequences of Theorem 18: 

If Wf is a ring that contains maximal right ideals then IL§ is a two-sided 
ideal. 

For (ILS) S URS MY. Hence IQ is a left ideal as well as a right 
ideal. 

The radical of a normed ring is a closed ideal. 

It is known that any maximal ideal 3 is closed.4* Hence # = WY is 
closed. 

If 2 is a semi-simple ring, IW} 0. Suppose, in addition, that Y satisfies 
the descending chain condition for right ideals. Then we can find a finite 
number of maximal right ideals such that 13; —0. We 


may suppose that the set {%;} is minimal in the sense that [] %=W’i30 
kAi 

for every i. Then Yi A Si = 0 and Yi SH Bi. Since Ji is maximal it follows 

that = Ji + Vi. Hence is isomorphic to the difference U-group J, 

and SY; is minimal. Using a simple lattice-theoretic argument we can conclude 


that BW This proves the well-known 


THEOREM 19. Jf %& is a semi-simple ring that satisfies the descending 
chain condition for right ideals, X is a direct sum of a finite number of mini- 
mal right ideals. 


*Gelfand [1], p. 8. 18 Jacobson [1], p. 35. 
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Suppose again that % is any ring that contains maximal right ideals, 
Then, by the corollary to Theorem 17, RS M(S:M) for all maximal &, 
Conversely let ye Then Hence AyM = and this 
implies that ye 9t. We have therefore proved 


THEOREM 20. Let % be an arbitrary ring that contains maximal right 
ideals. Then the radical of is the intersection where ranges over 
the maximal right ideals of 2%. 


If W is not a radical ring, R= A) where ranges 
over the maximal right ideals. 


The results of this section hold also for left ideals. An interesting conse- 
quence of the second corollary to Theorem 18 is that if & is a ring with an 
identity then the intersection of the maximal right ideals of % coincides with 
the intersection of the maximal left ideals of Wf. 


10. Primitive rings. 


Definition 3. A ring & is primitive if MI contains a maximal right ideal 
whose quotient (3: = 0. 

Let % be of this type and as before let % denote the ring of endo- 
morphisms in Then W is irreducible and 
= — Thus any primitive ring is isomorphic to an irreducible 
ring of endomorphisms. 

Conversely suppose that % + 0 is an irreducible ring of endomorphisms 
acting in Mt. Let x be an element 0 in Mt and let Yz denote the totality of 


elements of such that 4) = 0. Evidently is a right ideal in Wef 


have seen that — Hence 32 < YW. Let be an element of not in 


and let é be arbitrary. Then za+40. Hence (va) % = M and so there exists 
an element such that Thus and _ Xo + aM for 
any @ not in Ye. This proves that Bz is maximal. If €4 06% there is a 
in Mt such that ye40. There exists an @ such that zi = y. It follows that 
~ 0. Thus for any 0 there exists an @ such that dé On the other 


hand, if Ge (So: M), for all d. Hence and (ez: 1) = 0. 


THEOREM 21. A necessary and sufficient condition that XI be primitiwe 
is that X be isomorphic to an irreducible ring of endomorphisms. 


If B is a two-sided ideal 40 in the irreducible ring %, the argument 
that we have used before for % shows that 2@ —M for any c0. Thus ¥ 


is irreducible. Hence we have the following 
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THEOREM 22. Any two-sided ideal in a primitive ring is primitive. 


For a deeper study of primitive rings we require a theorem on irreducible 
rings of endomorphisms recently proved by the author. We recall first that if 
% is irreducible in Yt then the totality of endomorphisms of Yt that commute 
with all the de & is a division subring D of the ring of endomorphisms of Yt.?® 
Since contains the identity mapping, we may regard MM as a vector space 
over D. 

We call a set & of linear transformations in a vector space Yt over D a 
dense set if for any finite set 2,: - -,@n of D-independent vectors and any 
finite set ¥:,° °°, Ym of arbitrary vectors there exists a linear transformation 
je % such that x —=y;. An equivalent condition is that if © is any finite 
dimensional subspace of Yt and A is any linear transformation in © over 9, 
then A can be extended to a linear transformation d« 2 of M over D.17. Hence 
if Mt is finite dimensional 2% — & the complete ring of linear transformations 
in Yt over D. We have proved elsewhere the following 

THEOREM 23. If 0 is an irreducible ring of endomorphisms in the 
commutative group Wt and D is the division ring of endomorphisms commu- 
tative with the Ge U, then A is a dense ring of linear transformations in the 
vector space Wt over D. , 


The converse is trivial. Thus the concepts of primitive ring, irreducible 
ring of endomorphisms and dense ring of linear transformations may be used 
interchangeably. 

If is a dense ring of linear transformations in Mt over D and 2, 3, ° 
are D-independent, then 


> (Ga, A > (Sa A Sn A Sa) 


where $2 denotes the right ideal of annihilators of z. Hence if Yt is infinite 
dimensional, then % does not satisfy the descending chain condition for right 
ideals. Thus if 2 satisfies the descending chain condition M is finite dimen- 
sional over D and therefore 9% = the complete ring of linear transformations 
in Mt over D. This proves 


THEOREM 24. Any primitive ring that satisfies the descending cham 
condition for right ideals is isomorphic to the complete ring of linear trans- 
formations of a suitable finite dimensional vector space over a division ring. 


16 This is Schur’s lemma. See Jacobson [1], p. 57. 
17 For the definitions and results on dense rings of linear transformations quoted 
here, see Jacobson [2]. 


| 
313 

als, 

this 

ight 
over § 


314 N. JACOBSON. 


Since the complete ring of linear transformations in a finite dimensional 
vector space is simple we have the 


CoROLLARY. Any primitive ring that satisfies the descending chain con- 
dition for right ideals is simple. 


11. Structure of semi-simple rings. We have seen that if & is not a 
radical ring, the radical St = II(: %) for the maximal right ideals 3. The 
difference rings %—(%:%) are isomorphic to irreducible rings of endo- 
morphisms. Hence they are primitive. In particular we have the following 


THEOREM 25. If & is a semi-simple ring XU contains two-sided ideals B 
such that 1) each UX — B is primitive and 2) 1B = 0. 


The converse holds also. For if 8 is a two-sided ideal in a ring and 
%— %B is semi-simple then 8 = Ht. Since any primitive ring is semi-simple 
each ideal 8 such that 2%—% is primitive contains ft. Hence if 1B —0 
Ft 0. 


Theorem 25 reduces, to a certain extent, the study of semi-simple rings 


to that of primitive rings and hence to that of dense rings of linear trans- J 


tormations'in a vector space over a division ring. This will be clearer in the 
next section where we derive a certain alternate form of this theorem. Before 
doing this we obtain some consequences of the theorem in the present form. 
We note first 


THEOREM 26. Any two-sided ideal in a semi-simple ring 1s semi-simple. 


Let 2, be a two-sided ideal in the semi-simple ring Mf and let {B} be a set f 


of two-sided ideals in 2 such that 1) each 2—¥ is primitive and 2) 18 —0. 


Let 8, = U, AB. Then W,— (UW, A B) = (%, + B) — B a two-sided ideal 
in the primitive ring & —%B. It follows that %,—%, is primitive. Evidently} 


118 = 0. Hence is semi-simple. 


Next let Y& be a’semi-simple ring that satisfies the descending chain con- 
dition for right ideals. Let {8} be a set of two-sided ideals satisfying thef 


conditions of Theorem 25. We may replace the set {8} by a finite set By 


i=1,---,s, and we may suppose that the set is minimal in the sense thatf 


0. Since is primitive and satisfies the descending 


chain condition for right ideals, 21’ —%B is simple and so B is a maximal two} 
sided ideal in It follows that Hence =U —B; 


simple. As for one-sided ideals we can prove that U=B’,@--- @ Bal 
This proves the Wedderburn-Artin structure theorem: 
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THEOREM 27. Any semi-simple ring that satisfies the descending chain 
condition for right ideals is isomorphic to a direct sum of a finite number of 
simple rings. 


By the preceding section the structure of the component ring 8’; is that 
of a ring of linear transformations in a finite dimensional vector space over a 
division ring. This is the second fundamental Wedderburn-Artin structure 
theorem. 


12. Infinite direct sums. Let S be an arbitrary set. We call the ele- 
ments of S points and we suppose that to each point P there is associated a 
ring Ep. Now let S = (S, Ep) be the totality of functions f(P) with domain 
S and with f(P) in Gp. If f(P) and g(P) «S we define f + g by (f+ 9) (P) 
=f(P) + 9(P) and we define fg(P) =f(P)g(P). Then it is readily veri- 
fed that S is a ring. We shall call it the complete direct sum of the rings Ep. 
Let So be the subset of functions f(P) such that f(P) = 0 for all but a finite 
number of the points P. Then Sp» is a subring of S. We call it the (discrete) 
direct sum of the rings Ep. Let Eg be the subset of So of elements f(P) 
such that f(P) =0 for all PQ. Then 1) Gg is a two-sided ideal in So, 
2) S, = 3Ee where Eq denotes the totality of finite sums of the elements 
of the ideals Eg and 3) Eg AZ Er —0. Conversely these conditions are 


sufficient that a ring be isomorphic to a direct sum of rings isomorphic to the 
ideals Eg. 

If & is a subring of the complete direct sum S we define the component 
Wp of M at P to be the totality of elements f(P) of Ep for f in M. It is clear 
that Mp is a subring of €p and the correspondence f—f(P) is a homo- 
morphism between % and Mp. The kernel of this homomorphism is the two- 
sided ideal Bp of elements g such that g(P) =0. Hence 18p—0. Moreover, 
Wp = — Bp. 

We suppose, conversely, that S is a set of two-sided ideals P= % such 


| that TB—0. Let Tp = M— and form the complete direct sum © —(S, Mp). 


Then each a in & determines an element @ in © such that 4(P) is the coset 
a+. The set & of the @ is a subring of S and the correspondence a—>4@ 
isa homomorphism. It is clear that component %p = Wp and that 4(P) = 0 
if and only if ae¥. Thus d=0 if and only if ae 1B and since 1B—0 
we see that the correspondence a—> 4d is, in fact, an isomorphism between 
and %. Using these remarks we may formulate Theorem 25 and its converse 
as follows: 


THEOREM 28. A necessary and sufficient condition that a ring UL be semi- 
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simple is that U be isomorphic to a subring XU of a complete direct sum (S, lp) 


where the components Up of X are primitive. 


This theorem serves as a substitute in the general theory for the 
Wedderburn-Artin Theorem 27. A number of special cases of the theorem 
are well-known. We cite, for example, Stone’s theorem on the representability 
of a Boolean ring as a subring of a direct sum of the fields Y2 of residues 
mod 2 and Montgomery and McCoy’s generalization of this theorem to commu- 
tative rings in which every element a satisfies the equations pa = 0, a? =a 
for a fixed prime p. These results follow from Theorem 28 and the easily 
proved fact that any primitive commutative ring is a field. 


13. Primitive rings that contain minimal ideals. If & is an arbitrary 
ring it is well-known that any minimal right ideal § in %& either has the form 
where e is an idempotent element or If is a minimal right 
ideal and a is arbitrary aj is either 0 or a minimal right ideal. It follows 
that the join 3% of all the minimal right ideals is a two-sided ideal.’® 

If & is an irreducible ring of endomorphisms in the commutative group 
M and B is a two-sided ideal 0 in M we have seen that 2B — M for every 
t~0in MN. B is primitive also. If 8, and B, are two-sided ideals = 0 then 
the equation 2B, = Mt implies that B,B.~0. We state this as 


Lemma 4. If B,, B. are two-sided ideals ~ 0 in the primitive ring %, 
0. 


This, of course, implies that 8, A 8.0. 

Suppose now that 2 is a primitive ring that contains minimal right ideals, 
Since is semi-simple any minimal right ideal of has the form 
e* Let be the two-sided ideal for all minimal right ideals 
We assert that % is a minimal two-sided ideal. For let %, be a two-sided ideal 
such that 3: < %. There exists a minimal right ideal 3 %:. Since § A &h 
is a right ideal and 3 is minimal it follows that 3 A 3: —=0. Hence also 
S31 = 0. Since is not nilpotent AX ~0. Evidently AY is a two-sided 


ideal and since 0, 3:1 by the lemma. The minimality of jf 
has therefore been established. It follows from Lemma 4 that % is contained f 


in every two-sided ideal 8 0 of %. We wish to show next that % is a simple 
ring. Let $1 < % now denote a two-sided ideal of Then S <3 


and 199 is a two-sided ideal of 2. Hence §%:% = 0 and since % is primitivey 


251 = 0. 


18 See, for example, Jacobson [1], p. 64. 


2° This ideal has been called the anti-radical by Baer. See Baer [1], p. 544. 
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THEOREM 29. Let WU be a primitive ring that contains minimal right 
ideals and let § = =X the join of all the minimal right ideals of U. Then & 
1s a. two-sided ideal that is contained in every two-sided ideal ~0 of U and § 
1s a simple ring. 

If B is a two-sided ideal ~ 0 of Mf, B contains every minimal right ideal 
x of YM. It is easy to see by the above reasoning that % is minimal in &. 
Hence 8% satisfies the same conditions as YI. 

Again let % be an irreducible ring of endomorphisms in a commutative 
group Mt and let D be the division ring of endomorphisms in Yt commutative 
with the elements of %. We call a linear transformation A in a vector space 
M over D finite valued if the image space YA is finite dimensional. We have 
shown elsewhere that if % contains finite valued linear transformations <~0 
then % contains minimal right ideals.*° Conversely suppose that % contains a 
minimal right ideal §. Then § = Mf; 2—é40. We wish to show that 
Mé is one dimensional. For, if not, there exist two D-independent elements 
z,y of M such that x, y. By the density of there exists a b in 
such that 2b = 0, yb 40. Then satisfies —0, and %. 
Hence if 32 denotes the annihilator of Since is minimal 
But this contradicts the fact that and 0. 

We have shown also that if % contains finite valued transformations ~0 
the totality § of these transformations is a two-sided ideal contained in every 
two-sided ideal £0 of Thus for all minimal right ideals %. 
This completes the proof of the following 


THEOREM 30. Let &=0 be an irreducible ring of endomorphisms in 
M and let D be the division ring of endomorphisms commutative with the 
elements of X. Then X contains minimal right ideals if and only tf it contains 
finite valued linear transformations ~0 of Mt over D. If the condition is 
satisfied the minimal two-sided ideal % coincides with the totality of finite 
valued transformations in %. 


We recall that a homomorphism a — @ between an abstract ring M and a 
ting of endomorphisms % in a commutative group M is called a representation 
of %. The representation is irreducible if U is irreducible. Two representations 
ure equivalent if the groups in which the endomorphisms act are %{-isomorphic. 
We have seen that the fundamental fact about a primitive ring is that it 
possesses a (1 — 1) irreducible representation. 


2° Jacobson [2], p. 232. *1 Jacobson [2], p. 230. 
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Suppose now that %& is a primitive ring that contains a minimal right 
ideal § and let a—>@ be any (1—1) irreducible representation of %. Let 
be the corresponding ring of endomorphisms and Yt the group in which they 
act. The image & of <%$ is +0. Hence there exists an z in Yt such that 
2340. Because of the irreducibility of M, z¥—M. It can be verified 
directly that the correspondence u— xu, u in &, is an Y-isomorphism between 
~ and Yt. Thus the representation in Yt is equivalent to the representation 
by the right multiplications in 3. This proves 

THEOREM 31. If YW is a primitive ring that contains minimal right 
ideals, any two (1—1) irreducible representations of X are equivalent. 


Now let %, and %2 be dense rings of linear transformations in Mt, over 
D, and M. over D2 respectively. Suppose that the M; are isomorphic under 
a correspondence @, —> dz and that the %; contain minimal right ideals. Then 
if M% is the abstract primitive ring isomorphic to the Mf; we have two irre- 
ducible representations a—d,, a—> 4d, of M. Since these representations are 
equivalent we have a (1—1) correspondence S : between 


Mt, and Pt. such that 
(a: + 9:)S = 2,5 + 9,8, (2,4,)S = (2,8) a>. 


The latter equation may be written in the form a, = S-4,S. Now it is easy 
to see that D; is the complete set of endomorphisms in Yt; commutative with 

i.” Hence the correspondence a, for in D, is an iso- 
morphism between D, and D2. This shows that the two vector spaces are 
isomorphic. If we identify these spaces, setting Mt; — M, Di — D, then 8 
becomes a semi-linear transformation in Mt over D. 


THEOREM 32. Two dense rings of linear transformations that contain 
minimal right ideals can not be isomorphic unless the spaces in which they 
act are isomorphic. If U, and U. are two dense rings in M over D and the 1; 
contain minimal right ideals and are isomorphic under a correspondence 
G, — a, then there exists a (1—1) semi-linear transformation S in M over D 
such that dz = S-*a,8 for all a. 


This result shows that for primitive rings that contain minimal right 
ideals we have precisely the same uniqueness of representation as a dense ring 
of linear transformations as we have in the classical case of simple rings that 
satisfy the descending chain condition. 


22 Jacobson [2], p. 233. 
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14, Atomic semi-simple rings. Let 2% be any ring that contains mini- 
mal right ideals and let % be the join S% of all the minimal right ideals J of YW. 
The right multiplications w— ua in § form an irreducible ring and therefore 
a semi-simple ring. It follows that if ® is the radical of & then SR —0. 
Hence 0. 

We shall call a ring & atomic if MU =F. If W is atomic W is semi-simple 
if and only if 2 contains no nilpotent ideals. Let % be of this type and let 3 
be a minimal right ideal in &. Then we have the following 


Lemma 5. If & is atomic and semi-simple and % is a minimal right 
ideal in U then UY = J and AZ is a simple ring. 


If B is any two-sided ideal of either B= YJ or FA 
Now if = 0, =0 contrary to the semi-simplicity. Hence AY = 
If B is any two-sided ideal of Y& properly contained in AY, BY. Hence 
3B —=0, and Hence B=0. Thus AX is a minimal 
two-sided ideal of &. It follows readily that MY is a simple ring. 

Let {2.2} be the set of distinct two-sided ideals NY, S$ minimal. Each 


is simple. Hence if «AB, WaM%g—0. It follows that A } Ug —0. 
By the lemma % = 3%,. Hence we have proved 


THEOREM 33. If YL is atomic and semi-simple % is isomorphic to a direct 
sum of simple rings. 


Each component 2LY is a join of minimal right ideals a of Y. It is easy 
to see that these ideals are also minimal in YM. Hence by Theorem 29 each 
3 is isomorphic to a dense ring of finite-valued linear transformations. 


THE JOHNS HopPpKINS UNIVERSITY. 
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ON EVEN-DIMENSIONAL SKEW-METRIC SPACES AND THEIR 
GROUPS OF TRANSFORMATIONS.* 


By Hwa-Cuunea LEE. 


In a previous paper* we have studied the differential geometry of an 
even-dimensional space Len with a non-singular skew-symmetric fundamental 
tensor dag (%, B, y, p,; o, T=1,:-+,2n). We shall here discuss certain 
properties of the following transformation groups in such a space. 

In the group @ of analytic point-transformations of Lon there is a sub- 
group G, of conformal transformations each leaving the fundamental tensor 
invariant save for a non-vanishing factor. Referring Len to an arbitrary 
chosen coordinate system a conformal point transformation z—> y satisfies, 
by definition, the equation 
dy? dy” 


(1) Ydap(r) = apo(y) 


If the factor y is restricted to be constant, we have the smaller subgroup G2 of 
special conformal transformations. If we require y = 1, we obtain the further 
subgroup G; of automorphisms. Thus 


G,C G,C 4, C G. 


1. Let us first dispose of the case of an Lz. In this case (1) reduces 


| to the single equation 


dy* dy? dy? 


Since every J, is flat* we may refer it to a coordinate system in which each 
component of the fundamental tensor is constant,® thus = di2(y) = 
const. (540), so that 


(2) d(x, x?) 


Hence 


THEOREM 1. For an I», the group Gs of automorphisms ts isomorphic 


* Received July 24, 1944. 

*H. C, Lee, “ A kind of even-dimensional differential geometry and its application 
to exterior calculus,” American Journal of Mathematics, vol. 65 (1943), p. 433. 

* Loc. cit., p. 434. 

* Loc. cit., p. 434. 
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to the unimodular group, and the special conformal group G2 ts tsomorphic 
to the group of transformations of constant functional determinants. The 
conformal group G, is identical with the group G of all transformations, the 
factor y being entirely unrestricted. 


We shall supose 2n > 2. If we differentiate (1) with respect to 2’, 
permute a, B, y cyclically and add, we find 


dy? dy? oy 
(3) +o = re. aya (x )+ aap (2) = Kpor(y) iy 


where K,g, denotes the curvature tensor. Transvecting (3) by the inverse 
av of the fundamental tensor, the result is reducible, in consequence of (1), 
to the form 


0x a 
where K, denotes the curvature vector.® If we differentiate (4) with respect 


to 28, interchange a, 8 and subtract, we get 
(5) bap(x) = bpo(y) 


where b,g denotes the first conformal curvature tensor. (5) is a necessary 
and sufficient condition for (4) to be integrable when regarded as a differential 
equation in y. 

If Lo» is conformally flat,’ the first conformal curvature tensor vanishes’ 
so that (5) is satisfied identically. Thus (4) is integrable in y. In fact in 
this case a multiplier ¢ exists (determined save for a constant factor) such 
that ¢aag is the fundamental tensor of the corresponding flat space, and K, 
(being a gradient) is expressed in terms of this ¢ by ® 


Ka(2) — (2n—2) | 
whence (4) yields 
$(2) 
6 = const. ——— 
Hence 


THEOREM 2. Ina conformally flat space Lon (2n > 2) with fundamental 


* Loc. cit., p. 433. 7 Loe. cit., p. 436. 
5 Loc. cit., p. 434. ® Loc. cit., p. 436. 
® Loc, cit., p. 435. ® Loc. cit., p. 436. 
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tensor dag, the factor y of a conformal point-transformation x—>y is de- 
termined save for a mulliplicative constant: 


o(y) 


where p ts the multiplier (determined to within a constant factor) which 


makes dag the fundamental tensor of the corresponding flat space. 


In this conformally flat case, dag is expressible in terms of ¢ and of a 
quantity a, (determined save for an additive gradient) in the form ?° 
p(x) dap(t) = ag(x) — q(x). 
We can then reduce (1) to 


Cag(x) — ao(y) Caa(r) — ap(y) = 0, 


7B 
whence 
dy? 
Ca,(x) — ap(y) gradient. 


The gradient on the right may be dropped on account of the additive gradient 
in dq. Hence 


THEOREM 3. The conformal group G, of a conformally flat space Lon 
(2n > 2) consists of those point-transformations under which dg behaves like 
a covariant pseudovector (vector except for a multiplicative constant) : 


dy? 
Cag (x) = ap(y) 
where ag is the quantity (determined to within an additive gradient) in terms 
of which the fundamental tensor of the corresponding flat space is expressed 
in the form 

If Loy itself is flat, then ¢(2) =¢(y) —const. so that (6) implies 

¥=const. Conversely y const. implies also ¢ = const. and so Lo» is flat. 
Thus 


THEOREM 4. For a flat space Lon (2n > 2) the special conformal group - 


° Loc. cit., p. 436. 
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G, is identical with the conformal group G,. Conversely, if for a conformally 
flat space Lon (2n > 2) Ge is identical with G,, the space itself is flat. 


Before identifying the group G; of automorphisms of a flat Le, we first 
introduce the generalized Poisson parentheses, 


2. Consider in a general Le, two functions of position and form the 


expression 
OF, OF 
(7) (Fi, F2) a 
Evidently we have 
(Fi, P2) =— (F2, 


and it is also easy to verify the identity 
OF, OF, OF: 
(Fi, (F2, Fs)) + (F2, (Fs, + (Ps, (Fi, 


where KY denotes the contravariant associate of the curvature tensor: 


dave 
K por = a” qo —_ 
aap ax? dat 


The above identity reduces to the Jacobi relation 
(Fi, (Fz; Fs)) + (Fe, (F's, Fi)) + (Ps, (Fi, = 
when Lo, is flat, referred to any coordinate system whatever. 
For a flat Ds,, there exist coordinate systems in which the components 
of the fundamental tensor are constants.** Such coordinate systems may be 
called preferred. In particular a preferred coordinate system will be called 


canonical when 
(8) adap ( I a ( I 39 


where J and O denote the unit and zero matrices of order n respectively. 
In a flat Le, referred to a canonical coordinate system, (7), on account 
of (8), may be written in the form 


(9) (Fi, = 2 Opi Ox* Op; 


4=1 


where pj = x"? ({=1,:--+,n). This is the well known Poisson parenthesis. 
To consider the group G; of automorphisms of a flat Lz, put y—1 in 
(1) and write it in the equivalent form 


11 Loc. cit., p. 434. 
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dy? dy? 

= qb 

(10) (y) = a(x) 

The right-hand side, in canonical coordinates, is no other than the Poisson 
parenthesis (y’,y%). ‘Thus (10) in these coordinates decomposes into 


(11) =9, =8, (qi, = 0, (1,7 


where gi = y"™*, which are precisely Poisson’s conditions for a canonical 
transformation ** or in particular for a homogeneous contact transformation. 
Hence 


THEOREM 5. The group G; of automorphisms of a flat Len is isomorphic 
to the group of canonical transformations in 2n variables. 


We remark that (10) is equivalent to 


dy? dy? 


= Apo(y) axe 


(which is just (1) with y —1) whose right-hand side is, by (8), of the form 


which is Lagrange’s parenthesis. Thus the above equation in a canonical 
coordinate system represents Lagrange’s condition for a canonical transforma- 
tion. Similarly, (10) is also equivalent to each of the following two 


(x) = = (4 y) dy? dy? apa(y) = (Z) dy? dy? 


which in a canonical coordinate system represent the Poisson and Lagrange 
conditions respectively for the inverse transformation to be a canonical 
transformation. 


3. The special conformal group G, of any Ln has a very remarkable 
property in the following connection. Consider a system of curves in Lon 
defined by a system of ordinary differential equations of the form 

{Z, 


2 See, for example, E. Goursat, Legons sur Vintégration des équations aux dérivées 
partielles du premier ordre, 2nd ed. (1921), p. 387. 

18 See, for example, L. P. Eisenhart, Continuous groups of transformations (1933), 
p. 250. 
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where ¢ is a parameter and Z# is a given function of ¢ and of the z’s. We call 
such a system of curves a Hamiltonian congruence, and the function H the 
Hamiltonian of the congruence. These denominations are justified when Le» 
is flat and is referred to a canonical coordinate system, for in this case (12) 
may be written on account of (8) in the form 


det 0H dH 


dt, dp’ df act’ 


which are the well known Hamilton’s canonical equations in Analytical 
Mechanics. By a point-transformation in L2,, a Hamiltonian congruence of 
curves is transformed into a system of curves which is in general no longer 
a Hamiltonian congruence. We call a point transformation a Hamiltonian 
transformation if it transforms every Hamiltonian congruence into a Hamil- 
tonian congruence. The Hamiltonian transformations in Z2, evidently form 
a group, which we shall now show to be no other than the special conformal 
group G.. In other words 


THEOREM 6. The special conformal group G. of any space Lo, consists 
of the Hamiltonian transformations of the space. 

To prove this theorem consider a Hamiltonian transformation zy 
which, by definition, transforms (12) for an arbitrary H into an equation 
of the same form 


dy? 0H (y, t) 

Since 

dt dx dt 
by (12), (18) may be written 

6H dy 8 0H 

(14) dy? Aap ( ) (2) 


To obtain the conditions of integrability of (14) for H we first find by 
differentiation 


Subtracting from this equation the equation obtained from it by interchanging 
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o and +, and in the result putting the coefficients of 0H/00* and °H/dxvdx8 
(which latter is symmetric in 8, y) equal to zero (since H is arbitrary), we 
obtain the required conditions 


+a" (2) 


oy? 


(16) aop(y) 


To simplify these conditions we first transvect (16) by => an (A,p=1, 
+, 2n), finding that (16) is equivalent to 


(17) wya{a% (x) + (x) — {a2 (x) 8,7 + a” (x) = 0 


where for the moment we put 


dy’ dy? 
(18) ap = Apa(y) ax 
Contracting (17) for y and p we find 


which may be written 
(19) Wap = Yap (2). 


Since this satisfies (17), it is equivalent to (16). Replacing wag in (19) by 
its definition (18), (19) assumes the form 


(20) = Apo(y) oy" 


Because of (20) we may write (15) thus 


0 dxf 

which implies 
(21) vy = const. 


We see from (20) and (21) that the Hamiltonian transformation z— y in 
question is a special conformal transformation. 

Consider, conversely, a special conformal transformation r— y. It satis- 
fies (20) and (21) by definition. Since the latter are equivalent to (15) 
and (16), they are, therefore, the conditions of integrability of (14) for H. 


ap 
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Hence (14) furnishes an H corresponding to an arbitrarily given H. By 
means of (14), (12) is transformed into (13), whence the special conformal 
transformation z— y is a Hamiltonian transformation. Theorem 6 is proved, 
In consequence of (20) we may write (14) in the form 
dH 6H 
* Gat 


whence 
(22) H(y,t) —yH(z,t) + x(t). 


The additive term y(t), which is an arbitrary function of ¢, may be omitted, 
since a Hamiltonian may be increased by any function of ¢ without affecting | 


the corresponding Hamiltonian congruence. Hence 


THEOREM 7. When a Hamiltonian congruence is transformed by a special 
conformal transformation of factor y, the Hamiltonian of the congruence ts 


simply multiplied by this factor y. 
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